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SPECIFICATION 



TO ALL WHOM IT MAY CONCERN: 

Be it known that we, Susan Lindquist a citizen of the United States of 
America, residing at 1200 East Madison Park, Chicago, 60615 in the and State of 
Illinois and Liming Li a citizen of China, residing at 1457 East Park, Chicago, 60637, 
in the State of Illinois and Jiyan Ma a citizen of China, residing at 1900 West 
Harrison Street Apt. 916, Chicago, 60612, in the State of Illinois and Jia-Jia Liu a 
citizen of China, residing at 1645 East 50"^ Street, #14J, Chicago, 60615, in the 
State of Illinois and Neal Sondheimer, a citizen of the United States of America, 
residing at 5722 South Stoney Island Avenue, #3, Chicago, 60637, in the State of 
Illinois and Thomas Scheibel, a citizen of Germanyj:esiding_at_5469^South-Gornell— 
Chicago, 60615, in the State of Illinois have invented a new and useful 
RECOMBINANT PRION-LIKE GENES AND PROTEINS AND MATERIALS AND 
METHODS COMPRISING SAME, of which the following is a specification. 
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RECOMBINANT PRION-LIKE GENES AND PROTEINS 
AND MATERIALS AND METHODS COMPRISING SAME 



This application claims priority benefit of United States Provisional 
Application No. 60/138,833, filed June 9, 1999, incorporated herein by reference. 
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certain rights in this invention. 



FIELD OF THE INVENTION 
10 The present invention relates generally to the fields of genetics and cellular 

and molecular biology. More particularly, the invention relates to amyloid or fibril- 
forming proteins and the genes that encode them, and especially to prion-like proteins and 
protein domains and the genes that encode them. 



DESCRIPTION OF RELATED ART 
1 5 Prions (protein infectious particles) have been implicated in both human 

and animal spongiform encephalopathies, including Creutzfeldt-Jakob Disease, kuru, 
Gerstmann-Strassler-Scheinker Disease, and fatal familial insomnia in humans; the 
recently-publicized "mad cow disease" in bovines; "scrapie," which afflicts sheep and 
goats; transmissible mink encephalopathy; chronic wasting disease of mule, deer, and elk; 
20 and feline spongiform encephalopathy. See generally S. Prusiner et ai, Cell, 93: 337-348 
(1998); S. Prusiner, Science, 278:245-251 (1997); and A. Horwich and J. Weissman, Ce//, 
89: 499-510 (1997). A currently-accepted theory is that a prion protein (PrP) can exist in 
at least two conformational states: a normal, soluble cellular form (PrP^) containing little 
p-sheet structure; and a "scrapie" form (PrP^*") characterized by significant p-sheet 
25 structure, insolubility, and resistance to proteases. Prion particles comprise multimers of 

the-Erl^form.— Prion-formatiGn-has-been-Gompared-and-eontrasted-to-amylGid-fibril 

formation that has been observed in other disease states, such as Alzheimer's disease. 
See J. Harper & P. Lansbury, Amm. Rev. Biochem, 66: 385-407 (1997). More generally, 
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the prion protein has been loosely classified (despite "some significant differences") as 
one of at least sixteen known human amyloidogenic proteins that, in an altered 
conformation, assemble into a fibril-like structure. See J.W. Kelly, Cwrr. Opin, Struct. 
BioL, 6: 11-17 (1996), incorporated herein by reference. 
5 There is growing patent and journal literature relating to scientists efforts 

to develop diagnostic, therapeutic, and prophylactic advances in the area of prion disease. 
For example, Fishleigh et al, U.S. Patent No. 5,773,572 describes synthetic peptides that 
have at least one antigenic site of a prion protein, and suggest using such peptides to raise 
antibodies and to create vaccines. Prusiner et aL, U.S. Patent No. 5,750,361 describes 
10 prion protein peptides having at least one a-helical domain and forming a random coil 
conformation in aqueous medium, and suggests using such a peptide to assay for the 
.a scrapie form of prion protein (PrP^^). 

Jj:; Weiss et al, J, Virology, 69(8): 4776-83 (1995) state that isolation of PrP*^ 

H from organisms has been a time-consuming and labor-intensive process. The authors 

1^1 1 5 purport to describe the synthesis of Syrian golden hamster prion protein as a fusion with 
' " glutathione S-transferase (GST) to enhance solubility and stability of PrP*^, and the release 

S of PrP^ from the fusion protein via thrombin cleavage. The authors report that only the 

Q cellular isoform PrP*^, and not the infectious PrP^'^ isoform, was produced. [See also 

% Volkel et al, Eur, J. Biochem, 257:462-471 (1998); Meeker et aL, Proteins: Structure, 

Q 20 Function, and Genetics, 30: 381-387 (1998) (Describing system to overexpress a fusion 
between the small, minimally soluble serum amyloid A protein and the bacterial enzyme 
Staphylococcal nuclease; and Zahn et aL, FEBS Lett., 417(3): 400-404 (1997) (reporting 
expression of human PrP proteins fused to a histidine tail to facilitate refolding).] 

Prusiner et al., U.S. Patent Nos. 5,792,901, 5,789,655, and 5,763,740 
25 describe a transgenic mouse comprising a prion protein gene that includes codons from a 
PrP gene that is native to a different host organism, such as humans, and suggest uses of 
such mice for prion disease research. The '655 patent teaches to incorporate "a strong 
epitope tag" in the PrP nucleotide sequence to permit differentiation of PrP protein 
conformations using an antibody to the epitope. The patents describing these native, 
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fusion between a putative nuclear localization signal of PrP and a green fluorescent 
protein to study targeting of the protein to the nuclear compartment. 

Weissmann et al., U.S. Patent No. 5,698,763, describes a transgenic mouse 
in which the PrP gene has been disrupted by homologous recombination, allegedly 
5 rendering the mouse non-susceptible to spongiform encephalopathies. Use of PrP anti- 
sense oligonucleotides to treat non-transgenic animals suffering from an incipient 
spongiform encephalopathy also is suggested. 

Cashman et aL, International Publication No. WO 97/45746, purports to 
describe prion protein binding proteins and uses thereof, e.g., to detect and treat prion- 

10 related diseases or to decontaminate samples known to contain or suspected of containing 
prion proteins. The authors also purport to describe a fusion protein having a PrP portion 
and an alkaline phosphatase portion, for use as an affinity reagent for labeling, detection, 
identification, or quantitation of PrP binding proteins or PrP^'^'s in a biological sample, or 
for use to facilitate the affinity purification of PRP binding proteins. 

15 In addition, there has been significant research in recent years concerning 

the biology of prion-like elements in yeast. [See, e.g.^ V. Kushnirov and M. Ter- 
Avanesyan, Cell, 94: 13-16 (1998); S. Lindquist, Cell, 89: 495-498 (1997); DePace etal. 
Cell, 93: 1241-1252 (1998); andR. Wickner, Annu. Rev. Genet., 30.109-139 (1996) (all 
incorporated herein by reference).] Although the two yeast prion-like elements that have 

20 been extensively studied do not spread from cell to cell (except during mating or from 

mother-to-daughter cell) and do not kill the cells harboring them, as has been observed in 
the case of mammalian PrP prion diseases, certain heritable yeast phenotypes exist that 
display a very "prion-like" character. The phenotypes appear to arise as the result of the 
ability of a "normal" yeast protein that has acquired an abnormal conformation to 

25 influence other proteins of the same type to adopt the same conformation. Such 

phenotypes include the [PST^] phenotype, which enhances the suppression of nonsense 
codons, and the [URE3] phenotype, which interferes with the nitrogen-mediated 
repression of certain catabolic enzymes. Both phenotypes exhibit cytoplasrhic inheritance 
by daughter cells from a mother cell and are passed to a mating partner of a \PSJ^^jot 



30 [URE3] cell. 
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Yeast organisms present, in many respects, far easier systems than 
mammals in which to study genotype and phenotype relationships, and the study of the 
[PST^] and [URE3] phenotypes in yeast has provided significant valuable information 
regarding prion biology. Studies have implicated the Sup35 subunit of the yeast 
5 translation termination factor and the Ure2 protein that antagonizes the action of a 
nitrogen-regulated transcription activator in the [PSr] and [URE3] phenotypes, 
respectively. In both of these proteins, the above-stated "nomial" biological functions 
reside in the carboxy-terminal domains, whereas the dispensable, amino-terminal domains 
have unusual compositions rich in asparagine and glutamine residues. 
10 It is the amino-terminal domains of these proteins (e.g., no more than 

about residues 2-113 of Sup35 and about residues 1-65 of Ure2) that have been 
^ implicated in conferring the and [URE3] phenotypes in a prion-like manner. King 

et al. . Proc. Natl Acad Sci USA, 94:661 8-6622 ( 1 997), purportedly expressed the N- 
M terminal 114 residues of SUP35 (with a cleavable polyhistidine tag for purification) and 

hi 15 reported that this peptide spontaneously aggregates to form thin filaments showing a P- 
^ sheet-type circular dichroism in vitro. Deletion of the amino termini of Sup35 and Ure2 
Q in yeast eliminates the [PSf^] and [URE3] phenotypes, respectively. In contrast, over- 

Pi expression of these proteins, or of their amino-terminal fi-agments, can induce the [PSf^] 

or [URE3] phenotype de novo. Once cells have acquired the [PSI^] or [URE3] phenotype 
13 20 in this manner, they continue to pass the trait to their progeny, even after the plasmid 

containing the over-expressed element is lost. [See Derkatch et al., Genetics, J 44:1375- 
1386(1996).] 

Interestingly, the Sup35 protein contains similarities to mammalian PrP 
proteins in that Sup35 is soluble in strains but prone to aggregate into insoluble, 

25 protease-resistant aggregates in [PSJ^] strains. In experiments using a fusion between the 
Sup35 amino terminus and green fluorescent protein (GFP, a protein that fluoresces green 
on exposure to blue light), it has been shown that the fusion protein is freely distributed in 
[psi-] cells but aggregated in [PSf] cells. See, e.g.. Glover et ciL, Cell, 89: 81 1-819 

(-l-997-);-and-Patino e/-fl/^5G/e«Ge^271:-622-626 (J^^^ 

30 shock proteins," such as the protein Hspl04 in yeast, have been implicated in the 

conformational conversion of Sup35 protein that is associated with the [PSt] phenotype 
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[see, e.g., J. Glover and S. Lindquist, Cell, 94: 73-82 (1998); V. Kushnirov and M. Ter- 
Avanesyan, Cell, 9^:13-16 (1998); Y.O.Chemoff a/.. Science, 268: 880-883 (1995)], 
and may be implicated in the conformational conversion of PrP. See, e,g., E. Schirmer 
and S. Lindquist, Proa Natl. Acad. ScL USA, 94: 13932-13937 (1997); S. DebBurman et 
5 ai, Proc. Natl. Acad. ScL USA^ 94: 13938-13943 (1997). 

As the foregoing discussion of literature indicates, there has been 
significant investigation into the biology of mammalian prions and prion-like yeast 
proteins for the purposes of developing a basic understanding of prion biology and 
developing effective measures for diagnosing, treating, and preventing mammalian prion 
10 diseases. Practical applications for prion and prion-like gene and proteins, in addition to 
the immediate medical implications of diagnosing, treating, and preventing spongiform 
encepalopathies and other amyloid diseases, is lacking. 

SUMMARY OF THE INVENTION 
The present invention is believed to be the first invention directed to 
15 employing unique features of prion biology in a practical context beyond fundamental 
prion research and applied research directed to the development of diagnostic, 
therapeutic, and prophylactic treatments of mammalian prion diseases (although aspects 
of the invention have utility in such contexts also). Likewise, the present invention is 
believed to be the first invention relating to the construction of novel prion-like elements 
20 that can change the phenotype of a cell in a beneficial way. 

In one aspect, the invention provides a polynucleotide comprising a 
nucleotide sequence that encodes a chimeric polypeptide, the polynucleotide comprising: 
a nucleotide sequence encoding at least one SCHAG amino acid sequence fused in frame 
with a nucleotide sequence encoding at least one polypeptide of interest other than a 
25 marker protein, or a glutathione S-transferase (GST) protein, or a staphylococcal nuclease 
protein. In a preferred embodiment, the polynucleotide has been purified and isolated. In 
another preferred embodiment, the polynucleotide is stably transformed or transfected 

into a livin g cell. ^ 

By "chimeric polypeptide" is meant a polypeptide comprising at least two 
30 distinct polypeptide segments (domains) that do not naturally occur together as a single 
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protein. In preferred embodiments, each domain contributes a distinct and useful property 
to the polypeptide. Polynucleotides that encode chimeric polypeptides can be constructed 
using conventional recombinant DNA technology to synthesize, amplify, and/or isolate 
polynucleotides encoding the at least two distinct segments, and to ligate them together. 
5 See, e.g., Sambrook et al.. Molecular Cloning - A Laboratory Manual^ Second Ed., Cold 
Spring Harbor Press (1989); and Ausubel et aL, Current Protocols in Molecular Biology, 
John Wiley & Sons, Inc. (1998); both incorporated herein by reference. 

The chimeric polypeptide comprises a SCHAG amino acid sequence as 
one of its polypeptide segments. By " SCHAG amino acid sequence" is meant any amino 

10 acid sequence which, when included as part or all of the amino acid sequence of a protein, 
can cause the protein to coalesce with like proteins into higher ordered aggregates 
commonly referred to in scientific literature by terms such as "amyloid," "amyloid 
fibers," "amyloid fibrils," "fibrils," or "prions." In this regard, the term SCHAG is an 
acronym for Self-Coalesces into Higher-ordered AGg re gates. By "higher ordered" is 

15 meant an aggregate of at least 25 polypeptide subunits, and is meant to exclude the many 
proteins that are known to comprise polypeptide dimers, tetramers, or other small 
numbers of polypeptide subunits in an active complex. The term "higher-ordered 
aggregate" also is meant to exclude random agglomerations of denatured proteins that can 
form in non-physiological conditions. [From the term "self-coalesces," it will be 

20 understood that a SCHAG amino acid sequence may be expected to coalesce with 

identical polypeptides and also with polypeptides having high similarity (e.g., less than 
10% sequence divergence) but less than complete identity in the SCHAG sequence.] It 
will be understood than many proteins that will self-coalesce into higher-ordered 
aggregates can exist in at least two conformational states, only one of which is typically 

25 found in the ordered aggregates or fibrils. The term "self-coalesces" refers to the property 
of the polypeptide to form ordered aggregates with polypeptides having an identical 
amino acid sequence under appropriate conditions as taught herein, and is not intended to 
imply that the coalescing will naturally occur under every concentration or every set of 
conditions. In fact, data exist s suggesting that trans-?LCim ^ factors, such as chap.erone 

30 proteins, may be involved in the protein's conformational switching, in vivo.) Aggregates 
formed by SCHAG polypeptides typically are rich in P-sheet structure, as demonstrated 
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by circular dichroism; bind Congo red dye and give a characteristic spectral shift in 
polarized light; and are insoluble in water or in solutions mimicking the physiological salt 
concentrations of the native cells in which the aggregates originate. In preferred 
embodiments the SCHAG polypeptides self-coalesce to form amyloid fibrils that typically 
5 are 5-20 nm in width and display a "cross-P" structure, in which the individual P strands 
of the component proteins are oriented perpendicular to the axis of the fibril. The 
SCHAG amino acid sequence may be said to constitute an "amyloidogenic domain" or 
"fibril-aggregation domain" of a protein because a SCHAG amino sequence confers this 
self-coalescing property to proteins which include it. 

10 Exemplary SCHAG amino acid sequences include sequences of any 

naturally occurring protein that has the ability to aggregate into amyloid-type ordered 
aggregates under physiological conditions, such as inside of a cell. In one preferred 
embodiment, the SCHAG amino acid sequence includes the sequences of only that 
portion of the protein responsible for the aggregation behavior. Many such sequences 

15 have been identified in humans and other animals, including amyloid P protein (residues 
1-40, 1-41, 1-42, or 1-43), associated with Alzheimer's disease; immunoglobulin light 
chain fragments, associated with primary systemic amyloidosis; serum amyloid A 
fi-agments, associated with secondary systemic amyloidosis; transthyretin and 
transthyretin fragments, associated with senile systemic amyloidosis and familial amyloid 

20 polyneuropathy I; cystatin C fi-agments, associated with hereditary cerebral amyloid 
angiopathy; P2-microglobulin, associated with hemodialysis-related amyloidosis; 
apolipoprotein A-1 fragments, associated with familial amyloid polyneuropathy HI; a 71 
amino acid fragment of gelsolin, associated with Finnish hereditary systemic amyloidosis; 
islet amyloid polypeptide fragments, associated with Type 11 diabetes; calcitonin 

25 fi-agments, associated with medullary carcinoma of the thyroid; prion protein and 

fragments thereof, associated with spongiform encephalopathies; atrial natriuretic factor, 
associated with atrial amyloidosis; lysozyme and lysozyme fragments, associated with 
hereditary non-neuropathic systemic amyloidosis; insulin, associated with injection- 
localized amyloidosis; and fibrinogen fragments, associated with hereditary renal 

30 amyloidosis:~See J.WrKelly, Curr. Op. Struct. Biol., 6: 11-17 (1996), incorporated herein 

by reference. In addition, several other SCHAG amino acid sequences of yeast and fungal 
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origin are described in detail below. Also, the Examples below set forth in detail how to 
use the SCHAG sequences specifically identified herein or elsewhere in the literature to 
screen databases or genomes for additional naturally occurring SCHAG amino acid 
sequences. The Examples also provide assays to screen candidate SCHAG sequences for 
5 prion-like properties. In addition, the Examples provide assays to rapidly screen random 
DNA fragments to determine whether they encode a SCHAG amino acid sequence. Such 
screening assays are themselves considered aspects of the invention. 

In addition, SCHAG amino acid sequences include those sequences 
derived from naturally occurring SCHAG amino acid sequences by addition, deletion, or 

10 substitution of one or more amino acids from the naturally occurring SCHAG amino acid 
sequences. Detailed guidelines for modifying SCHAG amino acid sequences to produce 
synthetic SCHAG amino acid sequences are described below. Modifications that 
introduce conservative substitutions are specifically contemplated for creating SCHAG 
amino acid sequences that are equivalent to naturally occurring sequences. By 

15 "conservative amino acid substitution" is meant substitution of an amino acid with an 
amino acid having a side chain of a similar chemical character. Similar amino acids for 
making conservative substitutions include those having an acidic side chain (glutamic 
acid, aspartic acid); a basic side chain (arginine, lysine, histidine); a polar amide side 
chain (glutamine, asparagine); a hydrophobic, aliphatic side chain (leucine, isoleucine, 

20 valine, alanine, glycine); an aromatic side chain (phenylalanine, tryptophan, tyrosine); a 
small side chain (glycine, alanine, serine, threonine, methionine); or an aliphatic hydroxyl 
side chain (serine, threonine). Altematively, similar amino acids for making conservative 
substitutions can be grouped into three categories based on the identity of the side chains. 
The first group includes glutamic acid, aspartic acid, arginine, lysine, histidine, which all 

25 have charged side chains; the second group includes glycine, serine, threonine, cysteine, 
tyrosine, glutamine, asparagine; and the third group includes leucine, isoleucine, valine, 
alanine, proline, phenylalanine, tryptophan, methionine, as described in Zubay, G., 
Biochemistry , third edition, Wm.C. Brown Publishers (1993). 

Also contemplated are modifications to naturally oc curring SCHAG amino 

30 acid"sequences that result in addition or substitution of polar residues (especially 

glutamine and asparagine, but also serine and tyrosine) into the amino acid sequence. 
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Certain naturally occurring SCHAG amino acid sequences are characterized by short, 
sometimes imperfect repeat sequences of, e.g., 5-12 residues. Modifications that result in 
substantial duplication of such repetitive oligomers are specifically contemplated for 
creating SCHAG amino acid sequences, too. 
5 hi another variation of the invention, the SCHAG amino acid sequence is 

encoded by a polynucleotide that hybridizes to any of the nucleotide sequences of the 
invention; or the non-coding strands complementary to these sequences, under the 
following exemplary moderately stringent hybridization conditions: 

(a) hybridization for 16 hours at 42°C in an aqueous hybridization solution 
10 comprising 50% formamide, 1% SDS, 1 M NaCl, 10% Dextran sulphate; and 

(b) washing 2 times for 30 minutes at 60°C in an aqueous wash solution 
comprising 0.1% SSC, 1% SDS. Alternatively, highly stringent conditions include 
washes at 68°C. 

Also provided are purified and isolated polynucleotide comprising a 

15 nucleotide sequence that encodes at least one SCHAG amino acid sequence, wherein the 
SCHAG-encoding portion of the polynucleotide is at least about 99%, at least about 98%, 
at least about 95%, at least about 90%>, at least about 85%, at least about 80%, at least 
about 75%, or at least about 70% identical over its full length to one of the nucleotide 
sequences of the invention. Methods of screening for natural or artificial sequences for 

20 SCHAG properties are also described elsewhere herein. 

A preferred category of SCHAG amino acid sequences are prion 
aggregation domains from prion proteins. The term "prion-aggregation domain" is 
intended to define a subset of SCHAG amino acid sequences that can exist in at least two 
conformational states, only one of which is typically found in the aggregated state. In one 

25 conformational state, proteins comprising the prion-aggregation domain or ftised to the 
prion-aggregation domain perform their normal fiinction in a cell, and in another 
conformational state, the native proteins form aggregates (prions) that phenotypically alter 
the cell, perhaps by sequestering the protein away from its normal site of subcellular 
activity, or by disrupting the conformation of an a ctivejdpmain-of^the-protein,-or-bv-- 

30 changing its activity state, or bay acquiring a new activity upon aggregation, or perhaps 
merely by virtue of a detrimental effect on the cell of the aggregate itself A hallmark 
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feature of prion-aggregation domains is that the phenotypic alteration that is associated 
with prion formation is heritable and/or transmissible: prions are passed from mother to 
daughter cell or to mating partners in organisms such as in the case of yeast Sup35, and 
Ure2 prions, perpetuating the [PSf] or [UR£3] prion phenotypes, or the prions are 
5 transmitted in an infectious manner in organisms such as in the case of PrP prions in 
mammals, leading to transmissible spongiform encephalopathies. This defining 
characteristic of prions is attributable, at least in part, to the fact that the aggregated prion 
protein is able to promote the rearrangement of unaggregated protein into the aggregated 
conformation (although chaperone-type proteins or other trans-acting factors in the cell 
10 may also assist with this conformational change). It is hkewise a feature of prion- 
aggregation domains that over-production of proteins comprising these domains increases 
the frequency with which the prion conformation and phenotype spontaneously arises in 
cells. 

Prion aggregation amino acid sequences comprising amino terminal 

15 sequences derived from yeast or fungal Sup35 proteins, Ure2 proteins, or the carboxy 
terminal sequences derived from yeast Rnql proteins are among those that are highly 
preferred. Referring to the S. cerevisiae Sup35 amino acid sequence set forth in SEQ ID 
NO: 2, experiments have shown that no more than amino acids 2-113 (the N domain) of 
that sequence are required to confer some prion aggregation properties to a protein, 

20 although inclusion of the charged "M" (middle) region immediately downstream of these 
residues, e.g^., thru residue 253, is preferred in some embodiments. The N domain alone 
is very amyloidogenic and immediately aggregates into fibers, even in the presence of 2 
M urea, a phenomenon that is desirable in embodiments of the invention where formation 
of stable fibrils of chimeric polypeptides is preferred. When the N domain is fused to the 

25 highly charged M domain, fiber formation proceeds in a slower, more orderly way. The 
M domain is postulated to shift the equilibrium to permit greater "switchability" between 
aggregated and soluble forms, and is preferably included where phenotypic switching is 
desirable. Referring to the S. cereviciae Ure2 amino acid sequence set forth in SEQ ED 
NO: 4, experiments have shown that no mo re than amino acids 2-6 5 of that_sequence~are— 

30 required to confer prion aggregation activity to a protein. Referring to the S. cereviciae 
Rnql amino acid sequence set forth in SEQ ID NO: 50, experiments have shown that no 
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more than amino acids 153-405 of that sequence are required to confer prion aggregation 
activity to a protein. Moreover, sequences differing from the native sequences by the 
addition, deletion, or substitution of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 
18, 19, 20, or more amino acids, especially the addition or substitution of additional 
5 glutamine or asparagine residues, but which retain the properties of prion- aggregation 
domains as described in the preceding paragraph, are contemplated. Also, orthologs 
(corresponding proteins or prion aggregation domains thereof from different species) 
comprise an additional genus of preferred sequences (Kushinov et al., Yeast 5:461-472 
(1990); Chemoff aL, Mol Microbiol 35:865-876 (2000); Santoso et al. Cell 100:211- 

10 288 (2000); and Kushinov e/ ^/., EMBO J 19:32^-31 (2000)). By way of example, Sup35 
amino acid sequences from Pichia pinus and Candida albicans are set forth in Genbank 
Accession Nos. X56910 (SEQ ID NO: 46) and AF 020554 (SEQ ED NO: 47), 
respectively. Polypeptides of the invention include polypeptides that are encoded by 
polynucleotides that hybridize under stringent, preferably highly stringent conditions, to 

1 5 the polynucleotide sequences of the invention, or the non-coding strand thereof. 

Polypeptides of the invention also include polypeptides that are at least about 99%, at 
least about 98%, at least about 95%>, at least about 90%, at least about 85%), at least about 
80%, at least about 75%, or at least about 70% identical to one of SCHAG amino acid 
sequences of the invention. 

20 As set forth above, in some aspects of the invention, the nucleotide 

sequence encoding the SCHAG amino acid sequence of the polypeptide is fiised in frame 
with a nucleotide sequence encoding at least one polypeptide of interest. By "in frame" is 
meant that when the nucleotide is transformed into a host cell, the cell can transcribe and 
translate the nucleotide sequence into a single polypeptide comprising both the SCHAG 

25 amino acid sequence and the at least one polypeptide of interest. It is contemplated that 
the nucleotide sequences can be joined directly; or that the nucleotide sequences can be 
separated by additional codons. Such additional codons may encode an endopeptidase 
recognition sequence or a chemical recognition sequence or the like, to permit enzymatic 
or chemical cleavage of the SCHAG amino acid se quence from the polvBeutide-of 

30 interest, to permit isolation of the polypeptide of interest. Preferred recognition sequences 
are sequences that are not found in the polypeptide of interest, so that the polypeptide of 
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interest is not internally cleaved during such isolation procedures. It will be understood 
that modification of the polypeptide of interest to eliminate internal recognition sequences 
may be desirable to facilitate subsequent cleavage from the SCHAG amino acid sequence. 
Suitable enzymatic cleavage sites include: the amino acid sequences -(Asp)„-Lys-, 
wherein n signifies 2, 3 or 4, recognized by the protease enterokinase; -Ile-Glu-Gly-Arg-, 
recognized by coagulation factor X^^; an arginine residue or a lysine residue cleaved by 
trypsin; a lysine residue cleaved by lysyl endopeptidase; a glutamine residue cleaved by 
V8 protease, and a glu-asn-leu-tyr-phe-gln-gly site recognized by the tobacco etch virus 
(TEV) protease. Suitable chemical cleavage sites include tryptophan residues cleaved by 
3-bromo-3-methyl-2-(2-nitrophenylmercapto)-3H-indole; cysteine residues cleaved by 2- 
nitroso-5-thiocyano benzoic acid; the dipeptides -Asp-Pro- or -Asn-Gly- which can be 
cleaved by acid and hydroxylamine, respectively; and a methionine residue which is 
specifically cleaved by cyanogen bromide (CNBr). In another variation, the additional 
codons comprise self-splicing intein sequences that can be activated, e.g.^ by adjustments 
topH. See Chong a/.. Gene, 792:27-281 (1997). 

Additional codons also may be included between the sequence encoding 
the prion aggregation amino acid sequence and the sequence encoding the protein of 
interest to provide a linker amino acid sequence that serves to spatially separate the 
SCHAG amino acid sequence from the polypeptide of interest. Such linkers may 
facilitate the proper folding of the polypeptide of interest, to assure that it retains a desired 
biological activity even when the protein as a whole has formed aggregates with other 
proteins containing the SCHAG amino acid sequence. Also, additional codons may be 
included simply as a result of cloning techniques, such as ligations and restriction 
endonuclease digestions, and strategic introduction of restriction endonuclease 
recognition sequences into the polynucleotide. 

In still another variation, the additional codons comprise a hydrophilic 
domain, such as the highly-charged M region of yeast Sup35 protein. While the N 
domain of Sup35 has proven sufficient in some cases to effect prion-like behavior, 
suggesting that the M region is not absolutely required in all cases, it is cmitemplated-that- 
the-M-regioirbTa~aifferent peptide that includes hydrophilic amino acid side chains will 
in some cases be helpfiil for modulating prion-like character of chimeric peptides of the 
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invention. Without intending to be limited to a particular theory, the highly charged M 
domain is thought to act as a "solublization" domain involved in modulating the 
equilibrium between the soluble and the aggregate forms of Sup35, and these properties 
may be advantageously adapted for other SCHAG sequences. 
5 By "polypeptide of interest" is meant any polypeptide that is of 

commercial or practical interest and that comprises an amino acid sequence encodable by 
the codons of the universal genetic code. Exemplary polypeptides of interest include: 
enzymes that may have utility in chemical, food-processing (eg., amylases), or other 
commercial applications; enzymes having utility in biotechnology applications, including 

10 DNA and RNA polymerases, endonucleases, exonucleases, peptidases, and other DNA 
and protein modifying enzymes; polypeptides that are capable of specifically binding to 
compositions of interest, such as polypeptides that act as intracellular or cell surface 
receptors for other polypeptides, for steroids, for carbohydrates, or for other biological 
molecules; polypeptides that comprise at least one antigen binding domain of an antibody, 

15 which are useful for isolating that antibody's antigen; polypeptides that comprise the 
ligand binding domain of a ligand binding protein (e,g,, the ligand binding domain of a 
cell surface receptor); metal binding proteins (e,g,^ ferritin (apoferritin), metallothioneins, 
and other metalloproteins), which are useful for isolating/purifying metals from a solution 
containing them for metal recovery or for remediation of the solution; light-harvesting 

20 proteins (e.g., proteins used in photosynthesis that bind pigments); proteins that can 

spectrally alter light (e.g., proteins that absorb light at one wavelength and emit light at 
another wavelength); regulatory proteins, such as transcription factors and translation 
factors; and polypeptides of therapeutic value, such as chemokines, cytokines, 
interleukins, growth factors, interferons, antibiotics, immunopotentiators and 

25 immunosuppressors, and angiogenic or anti-angiogenic peptides. 

However, specifically excluded from the scope of the invention are 
chimeric polynucleotides that have heretofore been described in the literature. For 
example, excluded from the scope of the invention are polynucleotides encoding a fusion 
consisting essentially of a SCHAG domain of a charact erized protein-fused Jn-frame-to 

30 only: ( 1 ) a marker protein such as a fluorescing protein (e.g., green fluorescent protein or 
firefly luciferase), an antibiotic resistance-conferring protein, a protein involved in a 
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nutrient metabolic pathway that has been used in the literature for selective growth on 
incomplete growth media, or a protein (e.g., p-galactosidase, an alkaline phosphatase, or 
a horseradish peroxidase) involved in a metabolic or enzymatic pathway of a 
chromogenic or luminescent substrate that results in the production of a detectable 
5 chromophore or light signal that has been used in the literature for identification, 
selection, or quantitation; or (2) a protein (e.g., glutathione S-transferase or 
Staphylococcal nuclease) that has been used in the hterature as a fusion partner for the 
express purpose of facihtating expression or purification of other proteins. 
Notwithstanding this exclusion of certain products from the invention, the inventors 

10 contemplate novel uses of such specifically excluded products as aspects of the present 
invention. Moreover, polynucleotides that include a SCHAG sequence, and sequence 
encoding a polypeptide of interest, and a sequence encoding a marker protein such as 
green fluorescent protein are considered within the scope of the invention. Also, 
notwithstanding the above exclusion, polynucleotides that encode polypeptides whose 

15 SCHAG properties are described herein for the first time, fiised to a marker protein, are 
considered within the scope of the invention. Also, purified fiasion polypeptides that have 
been described in the literature and examined only in vivo, but never purified, are 
intended as aspects of the invention. For example, isolated fibers comprising 
polypeptides encoding a fusion protein consisting of essentially one or more SCHAG 

20 sequences fused to a marker protein, e.g., GFP are contemplated. Several such examples 
are provided in Example 5. 

The encoding sequences of the polynucleotide maybe in either order, i,e,, 
the SCHAG amino acid encoding sequence may be upstream (5') or downstream (3') of 
the sequence, such that the SCHAG amino acid sequence of the resultant protein is 

25 disposed at an amino-terminal or carboxyl-terminal position relative to the protein of 
interest. In the case of SCHAG amino acid sequences identified or derived from 
sequences in nature, the encoding sequences preferably are ordered in a manner 
mimicking the order of the polypeptide from which the SCHAG amino acid sequence was 

derive d. For example, the yeast Su p35 protein_has_an_aminoJerminaLSCHAG domain 

30 and a carboxy-terminal domain containing Sup35 translation termination activity. Thus, 
in embodiments of the invention where the SCHAG amino acid encoding sequence is 
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derived from a Sup35 protein, this sequence preferably is disposed upstream (5') of the 
sequence encoding the at least one polypeptide of interest. In embodiments wherein the 
fibril-aggregation amino acid encoding sequence is derived from the sequence set forth in 
Genbank Accession No. p25367 (SEQ ED NO: 29) (where the prion-Hke domain is C- 
5 terminal), this sequence is preferably disposed downstream (3') of the sequence encoding 
the at least one polypeptide of interest. In an embodiment comprising sequences 
encoding two or more polypeptides of interest, the SCHAG encoding sequence may be 
disposed between the two polypeptides of interest. 

To the extent that such sequences are not already inherent in the above- 

10 described polynucleotides, it will be understood that such polynucleotides preferably 
further comprise a translation initiation codon fused in frame and upstream (5') of the 
encoding sequences, and a translation stop codon fused in frame and downstream (3*) of 
the encoding sequences. Also, it may be desirable in some embodiments to direct a host 
cell to secrete the chimeric polypeptide. Thus, it is contemplated that the polynucleotide 

1 5 may further comprise a nucleotide sequence encoding a translation initiation codon and a 
secretory signal peptide fused in frame and upstream of the encoding sequences. 

In preferred embodiments, the polynucleotide of the invention further 
comprises additional sequences to facilitate and/or control expression in selected host 
cells. For example, the polynucleotide includes a promoter and/or an enhancer sequence 

20 operatively connected upstream (5') of the encoding sequences, to promoter expression of 
the encoding sequences in the selected host cell; and/or a polyadenylation signal sequence 
operatively connected downstream (3*) of the encoding sequences. Since concentration is 
a factor that may influence the aggregation state of encoded chimeric polypeptides, 
regulatable (e.g., inducible and repressible) promoters are highly preferred. 

25 To facilitate identification of cells that have been successfully 

transformed/transfected with the polynucleotide of the invention, the polynucleotide may 
further include a sequence encoding a selectable marker protein. The selectable marker 
may be a completely distinct open reading frame on the polynucleotide, such as an open 
reading-frame encoding-anantibiotic-resistanGe-protein-or-a-protein-that-faGihtates 

30 survival in a selective nutrient medium. The selectable marker also may itself be part of 
the chimeric pol3T3eptide of the invention. In one embodiment, a visual marker such as a 
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fluorescent protein (e.g., green fluorescent protein) is used that is distributed in the cell in 
a different manner when the protein is in the prion form than when the protein is in the 
non-prion form. In either case, cells comprising the selectable marker can be sorted, e.g., 
using techniques such as fluorescence activated cell sorting. Thus, this marker, in 
5 addition to permitting selection of transformed or transfected cells, also permits 
identification of the conformational state of the chimeric polypeptide. In another 
embodiment, the marker has two components: 1) a function that is changed when the 
protein is in a prion form and 2) a visual or selectable marker for that function. An 
example is the glucocorticoid receptor, GR and a reporter gene. GR is a transcription 

10 factor that binds to a specific DNA sequence to activate transcription. When this DNA 
sequence is fused to the coding sequence for an easily detected protein such as 
p-galactosidase or luciferase GR function can be easily assayed by the induction of the 
P-galactosidase or luciferase proteins. 

Optionally, the polynucleotide of the invention further includes an epitope 

15 tag fiised in frame with the encoding sequences, which tag is useful to facilitate detection 
in vivo or in vitro and to facilitate purification of the chimeric polypeptide or of the 
protein of interest after it has been cleaved fi-om the SCHAG amino acid sequence of the 
chimeric polypeptide. (An epitope tag alone is not considered to constitute a polypeptide 
of interest.) A variety of natural or artificial heterologous epitopes are known in the art, 

20 including artificial epitopes such as FLAG, Strep, or poly-histidine peptides. FLAG 

peptides include the sequence Asp-Tyr-Lys-Asp-Asp-Asp-Asp-Lys (SEQ ID NO: 5) or 
Asp-Tyr-Lys-Asp-Glu-Asp-Asp-Lys (SEQ ID NO: 6). [See generally Brewer, 
Bioprocess. TechnoL, 2: 239-266 (1991); Kunz, J. Biol. Chem., 267: 9101-9106 (1992); 
Brizzard et ai, Biotechniques 16: 13Q-135 (1 994); Schafer, Biochem. Biophys. Res. 

25 Commun., 207: 708-714 (1995).] The Strep epitope has the sequence Ala-Trp-Arg-His- 
Pro-Gln-Phe-Gly-Gly (SEQ ID NO: 7). [See Schmidt, J. Chromatography, 676: 337-345 
(1994).] Another commonly used artificial epitope is a poly-His sequence having six 
consecutive histidine residues. Commonly used naturally-occurring epitopes include the 
influenza-virus-hemagglutinin-sequenee^yr-Pro-Tyr-Asp-Val-Pro-Asp-T-^ — 

30 Gly-Arg (SEQ ED NO: 8) and truncations thereof, which is recognized by the monoclonal 
antibody 12CA5 [Murray a/.. Anal. Biochem., 229: 170-179 (1995)] and the sequence 
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(Glu-Gln-Lys-Leu-Leu-Ser-Glu-Glu-Asp-Leu-Asn) (SEQ ID NO: 9) from human c-myc, 
which is recognized by the monoclonal antibody 9E10 (Manstein et ai, Gene, 162: 129- 
134(1995)). 

In another embodiment, the polynucleotide includes 5* and 3' flanking 
5 regions that have substantial sequence homology with a region of an organism's genome. 
Such sequences facilitate introduction of the chimeric gene into the organism's genome by 
homologous recombination techniques. 

In yet another aspect, the invention provides a polynucleotide comprising a 
nucleotide sequence that encodes a chimeric polypeptide, the chimeric polypeptide 
1 0 comprising an amyloidogenic domain that causes the polypeptide to aggregate with 

polypeptides sharing an identical or nearly identical domain into ordered aggregates such 
,Q as fibrils, fused to a domain comprising a polypeptide of interest; wherein the 

amyloidogenic domain comprises an amyloidogenic amino acid sequence of a naturally 
;^ occurring protein and further includes a duplication of at least a portion of the natxirally 

uj 15 occurring amyloidogenic amino acid sequence, the duplication increasing the 
'.^ amyloidogenic affinity of the chimeric polypeptide relative to an identical chimeric 

]z: polypeptide lacking the duplication. By way of example, if the naturally occurring protein 

P comprises a Sup35 protein of Saccharomyces cerevisiae that is characterized by the 

partial amino acid sequence PQGGYQQYN (SEQ ID NO: 10), which sequence exists as 
20 multiple imperfect repeats, the duplication preferably includes the amino acid sequence 
PQGGYQQYN and/or an imperfect repeat thereof, such as a repeat wherein one or two 
residues has been added, deleted, or substituted. An exemplary sequence containing the 
NM regions of yeast Sup35, with two additional repeat segments, is set forth in SEQ ID 

NOs: 16andl7. - 

25 In a related aspect, the invention provides a polynucleotide comprising a 

nucleotide sequence that encodes a chimeric polypeptide, the chimeric polypeptide 
comprising an amyloidogenic domain that causes the polypeptide to aggregate with 
identical polypeptides into fibrils, fused to a domain comprising a polypeptide of interest; 

wherein the-amyloidogenie-domain-eomprises-amyloidogenie-amino-aeid-sequenGes-of at— 

30 least two naturally occurring amyloidogenic proteins. 
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In yet another related aspect, the invention provides a polynucleotide 
comprising a nucleotide sequence of the formula FPBT or FBPT, wherein: B comprises a 
nucleotide sequence encoding a polypeptide that is encoded by a portion of the genome of 
the cell; F and T comprise, respectively, 5' and 3' flanking sequences adjacent to the 
sequence encoding B in the genome of the cell; and P comprises a nucleotide sequence 
encoding a prion-aggregation amino acid sequence, wherein P is fused in fi-ame to B. 
Using such polynucleotides and conventional homologous recombination techniques [see, 
e.g., Ausbel et al. (1998), Volume 3, supra], one can perform homologous recombination 
in a living cell to convert a protein-encoding gene of the cell to a prion gene of the cell, as 
described in greater detail below. Altematively, strains can be constructed wherein the 
endogenous protein-encoding gene is deleted and a prion version of the gene is added 
back into the cell, either on a plasmid or by integration into the host genome. 

The homologous recombination technique is itself intended as an aspect of 
the invention. For example, the invention provides a method of modifying a living cell to 
create an inducible and stable phenotypic alteration in the cell, comprising the steps of: 
transforming a living cell with the polynucleotide described in the preceding paragraph; 
culturing the cell under conditions that permit homologous recombination between the 
polynucleotide and the genome of the cell; and selecting a cell in which the 
polynucleotide has homologously recombined with the genome to create a genomic 
sequence comprising the formula PB or BP. 

More generally, the invention provides a method of modifying a living cell 
to create an inducible and stable phenotypic alteration in the cell, such as a method 
comprising steps of: identifying a target polynucleotide sequence in the genome of the 
cell that encodes a polypeptide of interest; and transforming the cell to substitute for or 
modify the target sequence, wherein the substitution or modification produces a cell 
comprising a polynucleotide that encodes a chimeric polypeptide, wherein the chimeric 
polypeptide comprises a SCHAG amino acid sequence fused in frame with the 
polypeptide of interest. Such modifications can be performed in several ways, such as (1) 

homolGgous-reGombination-as-desGribed4n4he-preGeding-paragraphsi-(2~)-knoGkout-or 

inactivation of the target sequence followed by introduction of an exogenous chimeric 
sequence encoding the desired chimeric polypeptide; or (3) targeted introduction of a 
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SCHAG-encoding polynucleotide sequence upstream and in-frame with the target 
sequence encoding the polypeptide of interest; (4) subsequent cloning or sexual 
reproduction of such cells; and/or other techniques developed by those in the art. 

The foregoing aspects of the invention relate largely to polynucleotides. 
5 Also intended as part of the invention are vectors comprising the polynucleotides, and 
host cells comprising either the polynucleotides or comprising the vectors. Vectors are 
useful for amplifying the polynucleotides in host cells. Preferred vectors include 
expression vectors, which contain appropriate control sequences to permit expression of 
the encoded chimeric protein in a host cell that has been transformed or transfect with the 

10 vectors. Both prokaryotic and eukaryotic host cells are contemplated as aspects of the 
invention. The host cell may be from the same kingdom (prokaryotic, animal, plant, 
fiingi, protista, etc.) as the organism from which the SCHAG amino acid sequence of the 
polynucleotide was derived, or from a different kingdom, hi a preferred embodiment, the 
host cell is from the same species as the organism from which the SCHAG amino acid 

15 sequence of the polynucleotide was derived. 

fri yet another embodiment, the invention includes a host cell transformed 
or transfected with at least two polynucleotides encoding chimeric polypeptides according 
to the invention, wherein the at least two polynucleotides comprise compatible SCHAG 
amino acid sequences and distinct polypeptides of interest. Such host cells are capable of 

20 producing two chimeric polypeptides of the invention, which can be induced in vitro or in 
vivo to aggregate with each other into higher ordered aggregates. As explained in greater 
detail below, such aggregates can be advantageously employed in multi-step chemical 
reactions when the two or more polypeptides of interest each participate in a step of the 
reaction. Experiments using fluorescence resonance energy transfer (FRET) have 

25 demonstrated the efficacy of heterogeneous polypeptide aggregation into co-polymers. 

hi addition, the chimeric polypeptides encoded by any of the foregoing 
polynucleotides are intended as an aspect of the invention. Purified polypeptides are 
preferred, and are obtained using conventional polypeptide purification techniques. For 
exam ple, the invention provides a chimeric polvpeptide-Comprisingi-at-least-one-SGHAG- 

30 amino acid sequence and at least one polypeptide of interest other than a marker protein, a 
glutathione S-transferase (GST) protein, or a Staphylococcal nuclear protein. As 
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described above, the SCHAG amino acid sequence may be directly linked (via a peptide 
bond) to the polypeptide of interest, or may be indirectly linked by virtue of the inclusion 
of an intermediate spacer region, a solubility domain, an epitope to facilitate recognition 
and purification, and so on. 
5 As explained herein in detail, polypeptides of the invention are capable of 

existing in a conformation in which the polypeptide coalesces with similar polypeptides 
into ordered aggregates that may be referred to as "amyloid," "fibrils," "prions;" or 
"prion-like aggregates." Such ordered aggregates of polypeptides of the invention are 
intended as an additional aspect of the invention. Such ordered aggregates tend to be 

10 insoluble in water or under physiological conditions mimicking a host cell, and 

consequently can be purified and isolated using standard procedures, including but not 
limited to centrifiigation or filtration. In a preferred embodiment, the SCHAG amino acid 
sequence is an amino acid sequence that will self-coalesce into ordered "cross-P" fibril 
structures that are filamentous in character, in which individual P-sheet strands of 

15 component chimeric proteins are oriented perpendicular to the axis of the fibril. In a 

highly preferred embodiment, the polypeptide of interest is disposed radiating away fi-om 
the fibril core of SCHAG peptide sequences, and retains one or more characteristic 
biological activities (e.g., binding activities for polypeptides of interest that have specific 
binding partners; enzymatic activity for polypeptides of interest that are enzymes). 

20 In still another embodiment, the invention provides a composition 

comprising an ordered aggregate of at least two chimeric polypeptides of the invention, 
wherein the at least two chimeric polypeptides have compatible SCHAG amino acid 
sequences and distinct polypeptides of interest. By "compatible" SCHAG amino acid 
sequences is meant SCHAG amino acid sequences that are either identical or sufficiently 

25 similar to permit co-aggregation with each other into higher ordered aggregates. In a 
preferred embodiment, the two or more polypeptides of interest retain their native 
biological activity (e.g., binding activity; enzymatic activity) in the ordered aggregate. 
Such aggregates can be advantageously employed in multi-step chemical reactions, as 

described in detail below. ^ 

30 The invention further includes methods of making and using 

polynucleotides and polypeptides of the invention. 
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For example, the invention provides a method comprising the steps of: 
transforming or transfecting a cell with a polynucleotide of the invention; and growing the 
cell under conditions which result in expression of the chimeric polypeptide that is 
encoded by the polynucleotide in the cell. In a preferred embodiment, the method further 
5 includes the step of isolating the chimeric polypeptide from the cell or from growth 

medium of the cell. In one variation, the method further comprises the step of detaching 
the SCHAG amino acid sequence of the protein from the polypeptide of interest. As 
described above in detail, the detachment may be effected with any appropriate means, 
including chemicals, proteolj^ic enzymes, self-splicing inteins, or the like. Optionally, 

10 the method further includes the step of isolating the protein of interest from the SCHAG 
amino acid sequence. 

In a related embodiment, the invention provides a method of making a 
protein of interest, comprising the steps of: transforming or transfecting a cell with a 
polynucleotide, the polynucleotide comprising a nucleotide sequence that encodes a 

15 chimeric polypeptide, the chimeric polypeptide comprising an amyloidogenic domain that 
causes the polypeptide to aggregate with identical polypeptides into higher-ordered 
aggregates such as fibrils, fiised to domain comprising a polypeptide of interest; growing 
the cell under conditions which result in expression of the chimeric polypeptide in the cell 
and aggregation of the chimeric polypeptide into fibrils; and isolating the chimeric 

20 polypeptide from the cell or from growth medium of the cell. In a preferred embodiment, 
the isolating step comprises the step of separating the fibrils from soluble proteins of the 
cell. In a highly preferred embodiment, the method further comprises the steps of 
proteolytically detaching the amyloidogenic domain of the chimeric protein from the 
polypeptide of interest; and isolating the polypeptide of interest. Preferably the detached 

25 polypeptide of interest maintains one or more of its biological functions, e.g., enzymatic 
activity, the ability to bind to its ligand, the ability to induce the production of antibodies 
in a suitable host system, etc. 

In yet another aspect, the invention provides a method of modifying a 
living cell to c reate an inducible and stable p henotytm-alteration-in-the-cell.-Eor-example, 

30 such a method comprising the step of transforming or transfecting a living cell with a 
polynucleotide according to the invention, wherein the polynucleotide includes a 
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promoter sequence to promote expression of the encoded chimeric polypeptide in the cell, 
the promoter being inducible to promote increased expression of the chimeric polypeptide 
to a level that induces aggregation of the chimeric polypeptide into higher-ordered 
aggregates such as fibrils. In one preferred embodiment, the method further comprises 
the step of growing the cell under conditions which induce the promoter, thereby causing 
increased expression of the polypeptide and inducing aggregation of the chimeric 
polypeptide into aggregates or fibrils in the cell. In a highly preferred embodiment, the 
host cell lacks any native protein that contains the same SCHAG amino acid sequence 
that might co-aggregate with the chimeric polypeptide. For example, the SCHAG amino 
acid sequence comprises an amino terminal domain of a Sup35 protein, and the host cell 
is a yeast cell that comprises a mutant Sup35 gene that expresses a Sup35 protein lacking 
an amino terminal domain capable of prion aggregation. In such host cells, the chimeric 
polypeptide can be expressed at a high level and induced to aggregate without 
concomitant precipitation of the host cell's Sup35 protein into the aggregates, which could 
be detrimental to host cell viability. 

In yet another aspect, the invention provides methods for reverting the 
phenotype obtained according to the method described in the preceding paragraph. One 
such method comprises the step of overexpressing a chaperone protein in the cell to 
convert the polypeptide fi-om a fibril-forming conformation into a soluble conformation. 
In a preferred embodiment, the chaperone protein comprises the Hspl04 protein of yeast, 
or a related HsplOO-type protein fi-om another species. Examples include the ClpB 
protein of E. coli and the AtlOl protein of Arabidopsis, [See generally Schirmer et al.. 
Trends in Biochemistry, 21: 289-296 (1996), incorporated herein by reference.] The over- 
expression is achieved, e.g., by placing the gene encoding the chaperone.protein under the 
control of an inducible promoter and inducing the promoter. 

Another such method for reverting the phenotype comprises the step of 
contacting the cell with a chemical denaturant at a concentration effective to convert the 
polypeptide from a fibril-forming conformation to a soluble conformation. Exemplary 

denaturants include guanidine HCl (preferab ly about 0.1 to 100 m M.-more-preferablv-l 

1 0 mM) and urea. In another variation, the cell is subjected to heat or osmotic shock for a 
period of time effective to convert the polypeptide's conformation. Both over-expression 
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of Hspl04 and growth on guanidine-HCl containing medium have proven effective for 
inducing phenotypic reversion of chimeric NM-GR prion constructs described in the 
Examples herein. 

In yet another aspect, the invention provides materials and methods for 
5 identifying novel SCHAG amino acid sequences. One such method comprises the steps 
of joining a candidate nucleotide sequence "X" to a nucleotide sequence encoding the 
carboxyl terminal domain of a Sup35 protein (CSup35), especially a yeast Sup35 protein, 
to create a chimeric polynucleotide of the formula 5'-XCSup35-3' or 5'-CSup35X-3'; 
transforming or transfecting a host cell with the chimeric polynucleotide; growing the 

10 host cell under conditions in which the host cell loses its native Sup35 gene, such that the 
chimeric polynucleotide becomes the only polynucleotide encoding CSup35; growing the 
resultant host cell under conditions selective for a nonsense suppressive phenotype; and 
selecting a host cell displaying the nonsense suppressive phenotype, wherein growth in 
the selective conditions is correlated with the candidate nucleotide sequence X encoding a 

15 SCHAG amino acid sequence. Additional methods steps and alternative methods are 

described in detail below in the Examples. In one variation, the Csup35 is substituted by 
a different protein domain for which selection on the basis of inactivation is possible. 

Many of the foregoing aspects of the invention relate, at least in part, to 
embodiments that involve chimeric polynucleotides and polypeptides, wherein properties 

20 of SCHAG amino acid sequences are advantageously employed through attaching them to 
other sequences using recombinant molecular biological techniques. In another variation 
of the invention, the advantageous properties of SCHAG amino acid sequences are 
exploited by making SCHAG sequences with sites that are modifiable using organic 
chemistry or enzymatic techniques. 

25 For example, in one embodiment, the invention provides a method of 
making a readable SCHAG amino acid sequence comprising the steps of identifying a 
SCHAG amino acid sequence, wherein polypeptides comprising the SCHAG amino acid 
sequence are capable of forming ordered aggregates; analyzing the SCHAG amino acid 
sequence to identify at leas t one amino acid residue in the_sequenceJiaving-a-side-Ghain 

30 exposed to the environment in an ordered aggregate of polypeptides that comprise the 
SCHAG amino acid sequence; and modifying the SCHAG amino acid sequence by 
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substituting an amino acid containing a reactive side chain for the amino acid identified as 
having a side chain exposed to the environment in an ordered aggregate of polypeptides 
that comprise the SCHAG amino acid sequence. By "reactive" side chain is meant an 
amino acid with a charged or polar side chain that can be used as a target for chemical 
5 modification using conventional organic chemistry procedures, preferably procedures that 
can be performed in an environment that will not permanently denature the protein. In 
preferred embodiments, the amino acid containing a reactive side chain is cysteine, lysine, 
tyrosine, glutamate, aspartate, and arginine. The identifying step entails any selection of a 
SCHAG amino acid sequence. For example, the identifying can simply entail selecting 

10 one of the SCHAG amino acid sequences described in detail herein; or can entail 
screening of genomes, proteins, or phenotypes of organisms to identify SCHAG 
sequences (e.g., using methodologies described herein); or can entail de novo design of 
SCHAG sequences based on the properties described herein. 

Proteins comprising the SCHAG sequence are capable of coalescing into 

15 higher-ordered aggregates. The polypeptides of such aggregates have amino acids that are 
disposed internally (in close proximity only to other amino acids in the aggregate), and 
other amino acids whose side chains are exposed to the environment of the aggregate such 
that they contact molecules in the envirormient. In the method, the analyzing step entails 
a prediction or a determination of at least one amino acid within the SCHAG sequence 

20 that is exposed to the environment of an aggregate of the proteins, meaning that it is an 
amino acid that will likely contact chemical reagents that mixed with the aggregates. 
Amino acids in a SCHAG amino acid sequence having side chains exposed to the 
environment in ordered aggregates of polypeptides comprising the SCHAG amino acid 
sequence can be identified experimentally, for example, by structural analysis of mutants 

25 constructed using site-directed mutagenesis, e.g., high throughput cysteine scarming 

mutagenesis, as described in detail below in the Examples. Alternatively, specific amino 
acids in a SCHAG amino acid sequence can be predicted to have side chains that are 
exposed to the environment in ordered aggregates of polypeptides comprising the 
SCHAG a mino acid s equen ce based on st ructuraLstudies.or-Computer-modeHng-of-the 

30 SCHAG amino acid sequence. The step of modifying the amino acid sequence entails 
changing the identity of an amino acid within the sequence. For the purposes of such a 
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method, the act of inserting a reactive amino acid within the amino acid sequence, at a 
position essentially adjacent to the position of the identified amino acid, is considered the 
equivalent of substituting that amino acid for the identified amino acid. In other words, 
for the purposes of making a reactable SCHAG amino acid sequence, the term 
5 "substituting" should be understood to include inserting an amino acid within the amino 
acid sequence, at a position essentially adjacent to the position of the identified amino 
acid. 

It is contemplated that some naturally-occurring SCHAG amino acid 
sequences will fortuitously include one or more reactive amino acids whose side chains 

10 are exposed to the environment in polypeptide aggregates. Use of such naturally 

occurring SCHAG reactive amino acids is contemplated as an additional aspect of the 
invention. Moreover, modification of naturally occurring SCHAG amino acid sequences 
that contain an undesirable number of reactive amino acids to eliminate one or more 
reactive amino acids is contemplated. 

15 In a preferred embodiment, the method fiirther comprises a step of making 

a polypeptide comprising the reactable SCHAG amino acid sequence. Substitution of 
such amino acids with amino acid residues containing reactive side chains can be carried 
out in the laboratory by, e.g., site-directed mutagenesis of a SCHAG-encoding 
polynucleotide or by peptide synthesis of the SCHAG amino acid sequence. In another 

20 preferred embodiment, the invention additionally comprises the step of making a polymer 
comprising an ordered aggregate of polypeptide monomers wherein at least one of the 
polj^Deptide monomers comprises a reactable SCHAG amino acid sequence. For 
example, polypeptide monomers comprising the reactable SCHAG amino acid sequence 
are seeded with an aggregate or otherwise subjected to an environment favorable to the 

25 formation of an ordered aggregate or "polymer" of the polypeptide monomers. In yet 

another preferred embodiment, the invention further comprises the step of contacting the 
reactive side chains with a chemical agent to attach a substituent to the reactive side 
chains. The substituent itself may be a linker molecule to facilitate attachment of one or 
m ore additional molecules. The substituent m ay be-attached-using-a-chemical agent. 

30 Attachment of a substituent depends on the nature of the substituent, as well as the 
identity of the reactive side chain, and can be accomplished by conventional organic 
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chemistry procedures. Exemplary procedures for modifying the sulfhydryl group of a 
cysteine residue that has been introduced into a SCHAG amino acid sequence are 
described in greater detail below in the Examples. In preferred embodiments, the 
substituent is an enzyme, a metal atom, an affinity binding molecule having a specific 
5 affinity binding partner, a carbohydrate, a fluorescent dye, a chromatic dye, an antibody, a 
growth factor, a hormone, a cell adhesion molecule, a toxin, a detoxicant, a catalyst, or a 
light-harvesting or light altering substituent. In a preferred embodiment, the reactive 
amino acid that has been introduced into the SCHAG sequence will be substantially 
absent from the rest or the SCHAG amino acid sequence, or at least substantially absent 
1 0 from those portions of the sequence that are exposed to the environment in ordered 

aggregates of the polypeptide. This absence may be a natural feature, or may be the result 
of an additional modification step to substitute or delete other occurrences of the amino 
acid. Designing the reactable SCHAG amino acid sequence in this manner permits 
controlled chemical modification at the reactive sites that have been designed into the 
1 5 sequence, without modification of other residues. 

In yet another embodiment of the invention, the invention further 
comprises the steps of contacting the polypeptides comprising the reactive side chains 
with a chemical agent to attach a substitutent to the reactive side chains, thereby 
providing modified polypeptides, and making a polymer comprising an ordered aggregate 
;i 20 of polypeptide monomers, wherein at least some of the polypeptide monomers comprise 
the modified polypeptides. Exemplary procedures for making a polymer comprising an 
ordered aggregate of modified polypeptide monomers are described in greater detail 
below in the Examples. 

In yet another embodiment, the invention provides a method of making a 
25 reactable SCHAG amino acid sequence, wherein the SCHAG amino acid sequence is 
modified to contain exactly one, two, three, four, or some other specifically desired 
number of the reactive amino acids. An exemplary method comprises the steps of (a) 
identifying a SCHAG amino acid sequence, wherein polypeptides comprising the 
SCHAG amino acid sequence are capable of forming ordered aggregates; (b) analyzmg 



30 the SCHAG amino acid sequence to identify at least one amino acid residue in the 
sequence having a side chain exposed to the environment in an ordered aggregate of 
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polypeptides that comprise the SCHAG amino acid sequence; (c) modifying the SCHAG 
amino acid sequence by substituting an amino acid containing a reactive side chain for the 
amino acid identified as having a side chain exposed to the environment in an ordered 
aggregate of polypeptides that comprise the SCHAG amino acid sequence; (d) analyzing 
5 the SCHAG amino acid sequence to identify at least a second amino acid residue in the 
sequence having an amino acid side chain that is exposed to the environment in an 
ordered aggregate of polypeptides that comprise the SCHAG amino acid sequence; and 
(e) modifying the SCHAG amino acid sequence by substituting an amino acid containing 
a reactive side chain for at least one amino acid identified according to step (d), wherein 

10 the amino acid substituted in steps (c) and (d) differ, thereby making a reactable SCHAG 
amino acid sequence with at least two selectively reactable sites. This method can be 
further elaborated to create SCHAG amino acids sequences with more than two 
selectively reactable sites. By introducing two or more different reactive amino acids, a 
SCHAG sequence is created with two or more sites that can be separately 

1 5 reacted/modified. It will be appreciated that the method also can be performed to 

introduce the same reactive amino acid for each identified amino acid, to create two or 
more identical reactive sites in the SCHAG sequence. 

hi another embodiment of the invention, the invention provides 
polypeptides comprising a SCHAG amino acid sequence that has been modified by 

20 substituting at least one amino acid that is exposed to the environment in an ordered 

aggregate of the polypeptides with an amino acid containing a reactive side chain, as well 
as polynucleotides that encode the polypeptides, ha a fiirther embodiment, a substituent is 
attached to the reactive amino acid of the modified polypeptide of the invention or 
reactable SCHAG sequence, hi a highly preferred ernbodiment, the SCHAG amino acid 

25 sequence is modified to contain exactly one, two, three, four, or some other specifically 
desired number of the reactive amino acids, thereby providing a SCHAG amino acid 
sequence which is modifiable at controlled, stoichiometric levels and positions. To 
achieve this goal, modifications to remove undesirable, native reactive amino acids from 
a n aturally occurring SCHAG sequence are contemplated. Pol ype ptides com prising a 

30 naturally occurring SCHAG amino acid sequence characterized by one or more reactive 
amino acids, that have been modified by substituting or eliminating a natural reactive 
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amino acid, are considered a further aspect of the invention, as are polynucleotides that 
encode the polypeptides. 

The invention also provides polymers or fibers of ordered aggregates 
comprising polypeptide subunits wherein at least one of the polypeptide subunits 
5 comprises a reactable SCHAG amino acid sequence. By the term "fibril" or "fiber" is 
meant a filamentous structure composed of higher ordered aggregates. By " polymer" is 
meant a highly ordered aggregate that may or may not be filamentous, hi another 
embodiment, the polymer or fiber is modified or substituted by attaching a substituent to 
the reactable SCHAG amino acid sequence of the polypeptide subunits. Also 

10 contemplated are polymers or fibers that comprise more than one type of substituent by 
attachment of different substituents to the reactable SCHAG amino acid sequence of the 
polypeptide subunits of the polymer or fiber. Attachment of the substituents to the 
reactive side chains contained in the reactable SCHAG amino acid sequence can occur 
either before or after coalescing of the polypeptides comprising the reactable SCHAG 

15 amino acid sequences into polymers comprising ordered aggregates of the polypeptides. 
Modification by attachment of specific substituents to such polymers or fibers can confer 
distinct fiinctions to these molecules. Thus, polymers or fibers, wherein one or more 
discrete regions of the polymer or fiber are modified to enable a distinct function are 
contemplated. In another variation, different regions of a polymer or fiber are 

20 differentially modified to confer different fiinctions. Also contemplated are polymers or 
fibers containing patterns of attachments, and consequently patterns of fionctionalities. 
The invention also provides polymers comprising fibers wherein at least one fiber has a 
distinct fiinction different fi*om that of another fiber in the polymer. Fibers comprising 
polypeptides subunits that are capable of emitting light or altering the wavelength of the 

25 light emitted in response to binding of a ligand to the fiber can be used as highly sensitive 
biosensors. Polymers comprising fibers wherein some of the fibers comprise polypeptide 
subunits capable of absorbing light of one wavelength and emitting light of second 
wavelength, and other fibers comprising polypeptide subunits capable of absorbing the 
light emitted by the first set of fibers and emitt ingJjghLof a,differentJvvavelength-are-alsQ- 

30 contemplated. 
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In one preferred embodiment, the polymer or fiber is long and thin and 
contains no or few branches, except at positions defined by deliberate introduction of sites 
for interaction between the polypeptide subunits. Polymers or fibers in which the 
polypeptide subunits have been modified to enable directed interactions between the 
5 polypeptide subunits within a single polymer or fiber, or between two discrete polymers 
or fibers are contemplated. Polymers of fibers that have been modified to enable 
interactions to occur between separate polymers of fibers can be used to create a 
meshwork of polymers of fibers. In one variation, the meshwork can be generated 
reversibly by using interactions dependent on sulfhydryl groups present on the 

10 polypeptide subunits of the polymer of fiber. Such meshworks can be usefial, for example, 
for filtration purposes. In another preferred embodiment, a fibril, ordered aggregate, 
polymer or fiber is attached to a solid support. For example, binding of a polymer of fiber 
to a solid support can be mediated by biotin-avidin interactions, wherein the biotin is 
attached to the polymers or fibers and avidin is bound to the solid support or vice versa, 

15 In a related embodiment, the invention provides a method of making a 

polymer or fiber with a predetermined quantity of reactive sites for chemically modifying 
the polymer of fiber, comprising the steps of providing a first polypeptide comprising a 
first SCHAG amino acid sequence that is capable of forming ordered aggregates with 
polypeptides identical to the first polypeptide; providing a second polypeptide comprising 

20 a second SCHAG amino acid sequence that is capable of forming ordered aggregates with 
polypeptides identical to the first polypeptide or the second polypeptide, wherein the 
second SCHAG amino acid sequence includes at least one amino acid residue having a 
reactive amino acid side chain that is exposed to the environment and serves as a reactive 
site in ordered aggregates of the second polypeptide and; mixing the first and second 

25 polypeptides under conditions favorable to aggregation of the polypeptides into ordered 
aggregates, wherein the polypeptides are mixed in quantities or ratios selected to provide 
a predetermined quantity of second polypeptide reactive sites. In a preferred embodiment, 
the invention further comprises the step of reacting the reactive side chains to attach a 
substituent to the reactive amino acid side chains of the polymer of fiber. Alternative ly, 

30 the step of reacting the reactive side chains to attach a substituent to the reactive amino 
acid side chains is performed prior to mixing of the polypeptides comprising reactable 
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SCHAG amino acid sequences to from ordered aggregates. In yet another embodiment, 
the invention provides a method of making a polymer or fiber comprising a first 
polypeptide comprising a first SCHAG amino acid sequence and a second polypeptide 
comprising a second SCHAG amino acid sequence, wherein both the first and second 
5 SCHAG amino acid sequence includes at least one amino acid residue having a reactive 
amino acid side chain that is exposed to the environment and serves as a reactive site, and 
wherein the reactive amino acid side chains of the first and second SCHAG amino acid 
sequences that are exposed to the environment in ordered aggregates are not identical, 
thereby permitting selective reaction of the reactive amino acid side chain of the first 

10 SCHAG amino acid sequence without reacting the reactive amino acid side chain of the 
second SCHAG amino acid sequence. 

In another embodiment, the invention provides a method of making a 
polymer comprising two or more regions with distinct function comprising the steps of (a) 
providing a first polypeptide comprising a SCHAG amino acid sequence and a first 

15 functional domain and a second polypeptide comprising a SCHAG amino acid domain 
and a second functional domain that differs fi*om the first fimctional domain, wherein the 
SCHAG amino acid sequences of the polypeptides are capable of forming ordered 
aggregates with polypeptides identical to the first or second polypeptide; (b) aggregating 
the first polypeptide by subjecting a composition comprising the first polypeptide to 

20 conditions favorable to aggregation of the first polypeptide into ordered aggregates, 

thereby forming a polymer comprising a region containing polypeptides that include the 
first functional domain; and (c) mixing a composition comprising the second polypeptide 
with the polymer formed according to step (b), under conditions favorable to aggregation 
of the second polypeptide with the polymer of step (b), thereby forming a polymer 

25 comprising the first region containing polypeptides that include the first functional 

domain and a second region containing polypeptides that include the second fijnctional 
domain. In one preferred embodiment, the SCHAG amino acid sequences of the first and 
second polypeptides are identical. In another preferred embodiment, at least one of the 
first and second functional domains comprises an amino acid that compnses_aj;eactive__ 

30 amino acid side chain. In yet another preferred embodiment, at least one of the first and 
second functional domains comprises an amino acid sequence of a polypeptide of interest. 
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In another variation, the method further comprises the step of mixing a composition 
comprising the first polypeptide with the polymer formed according to step (c), under 
conditions favorable to aggregation of the first polypeptide with the polymer of step (c), 
thereby forming a polymer comprising the first region containing polypeptides that 
5 include the first functional domain, the second region containing polypeptides that 
include the second functional domain, and a third region containing polypeptides that 
include the first functional domain. Alternatively, the invention provides a method of 
making a polymer comprising two or more regions with distinct function wherein the 
method further comprises the steps of providing a third polypeptide that comprises a 

10 SCHAG amino acid sequence and a third funcfional domain that differs from the first and 
second functional domains, wherein the SCHAG amino acid sequence of the third 
polypeptide is capable of forming ordered aggregates with polypeptides identical to the 
first polypeptide or the second polypeptide; and mixing a composition comprising the 
third polypeptide with the polymer formed according to step (c), under conditions 

15 favorable to aggregation of the third polypeptide with the polymer of step (c), thereby 
forming a polymer comprising the first region containing polypeptides that include the 
first functional domain, the second region containing polypeptides that include the second 
functional domain, and a third region containing polypeptides that include the third 
functional domain. 

20 In still another variation, the invention provides various living cells with 

two or more customized, reversible phenotypes. For example, the invention provides a 
living cell that comprises: (a) a first polynucleotide comprising a nucleotide sequence 
encoding a polypeptide that comprises a prion aggregation domain and a domain having 
transcription or translation modulating activity, wherein the hving cell is capable of 

25 existing in a first stable phenotypic state characterized by the polypeptide existing in an 
unaggregated state and exerting a transcription or translation modulating activity and a 
second phenotypic state characterized by the polypeptide existing in an aggregated state 
and exerting altered transcription or translation modulating activity; and (b) an exogenous 
polynucleotide comprising a nucleotide sequence that encodes a polypeptjde^f i nterest, 

30 vv^th the proviso that the sequence encoding the polypepfide of interest includes a 
regulatory sequence causing differential expression of the polypeptide in the first 
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phenotypic state compared to the second phenotypic state. Exemplary prion aggregation 
domains are described with respect to Sup35, Rnql, and Ure2. The first polynucleotide 
may itself be an endogenous (native) polynucleotide of the cell, such as the native yeast 
Sup35 sequence in a yeast cell, which comprises a prion aggregation domain fused to a 
5 translation termination factor sequence. Alternatively, the first polynucleotide may be 
introduced into the cell (or a parent cell) using genetic engineering techniques. The term 
"exogenous polynucleotide" is meant to encompass any polynucleotide sequence that 
differs fi"om a naturally occurring sequence in the cell as a result of human genetic 
manipulation. For example, an exogenous sequence may constitute an expression 

10 construct that has been introduced into a cell, such as a construct that contains a promoter, 
a foreign polypeptide-encoding sequence, a stop codon, and a polyadenylation signal 
sequence. Alternatively, an exogenous sequence may constitute an endogenous 
polypeptide-encoding sequence that has been modified only by the introduction of a 
promoter, an enhancer, or other regulatory sequence that is not naturally associated with 

15 the polypeptide-encoding sequence. Introduction of a regulatory sequence that is 

influenced by the aggregation state of the polypeptide encoded by the first polynucleotide 
is specifically contemplated. In one preferred variation, the cell fiirther comprises a 
nucleotide sequence that encodes a polypeptide that modulates the expression level or 
conformational state of the polypeptide that comprises the prion aggregation domain. 

20 Such a polynucleotide facilitates manipulation of the cell to switch phenotypes. 
Polynucleotides encoding chaperone proteins that influence prion protein folding 
represent one example of this latter category of polynucleotide. In one specific variation, 
the invention provides a living cell according to claim 97, wherein the first polynucleotide 
comprises a nucleotide sequence encoding a polypeptide that comprises a prion 

25 aggregation domain fused in-fi-ame to a nucleotide sequence encoding a translation 

termination factor polypeptide; and wherein the regulatory sequence comprises a stop 
codon that interrupts translation of the polypeptide of interest. 

In another variation, the invention provides a living cell comprising: (a) a 
polynucleotide com prising a nucleotide sequence e ncoding,a,poIvpeptide-that-cQmprises-a- 

30 prion aggregation domain fused in-frame to a nucleotide sequence encoding a translation 
termination factor polypeptide; and (b) an exogenous polynucleotide comprising a 
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nucleotide sequence that encodes a polypeptide of interest, with the proviso that the 
sequence encoding the polypeptide of interest includes at least one stop codon that 
interrupts translation of the polypeptide of interest; wherein the living cell is capable of 
existing in a first stable phenotypic state characterized by translational fidelity and 
5 substantial absence of synthesis of the polypeptide of interest and a second phenotypic 
state characterized by aggregation of the translation termination factor, reduced 
translational fidelity, and expression of the pol>peptide of interest. 

Additional features and variations of the invention will be apparent to 
those skilled in the art fi"om the entirety of this application, including the drawing and 

10 detailed description, and all such features are intended as aspects of the invention. 

Likewise, features of the invention described herein can be re-combined into additional 
embodiments that also are intended as aspects of the invention, irrespective of whether 
the combination of features is specifically mentioned above as an aspect or embodiment 
of the invention. Also, only such limitations which are described herein as critical to the 

15 invention should be viewed as such; variations of the invention lacking limitations which 
have not been described herein as critical are intended as aspects of the invention. 

In addition to the foregoing, the invention includes, as an additional aspect, 
all embodiments of the invention narrower in scope in any way than the variations 
specifically mentioned above. Although the applicant(s) invented the full scope of the 

20 claims appended hereto, the claims appended hereto are not intended to encompass within 
their scope the prior art work of others. Therefore, in the event that statutory prior art 
within the scope of a claim is brought to the attention of the applicants by a Patent Office 
or other entity or individual, the applicant(s) reserve the right to exercise amendment 
rights under applicable patent laws to redefine the subject matter of such a claim to 

25 specifically exclude such statutory prior art or obvious variations of statutory prior art 
fi-om the scope of such a claim. Variations of the invention defined by such amended 
claims also are intended as aspects of the invention. 

BRIEF DESCRI PTlQN-0£XHE-DRA-WrNG 

Figure 1 depicts the DNA and deduced amino acid sequences (SEQ ED 
30 NOs: 50-51) of an NMSup35-GR chimeric gene described in Example 1. 



Figure 2 depicts a map of an integration plasmid described in Example 2 
which contains a chimeric gene comprising the amino-terminal domain of yeast Ure2 
protein, a hemagglutinin tag sequence, and the carboxyl-terminal domain of yeast Sup35 
protein. 

Figure 3 depicts the nucleotide sequence (SEQ ID NO: 49) of the plasmid 
of Figure 2. As shown in Figure 2, the NUre2-CSup35 chimeric gene is encoded on the 
strand complementary to the strand whose sequence is depicted in Figure 3. 

Figure 4 schematically depicts that the structure of wild-type (WT) yeast 
Sup35 protein (Top), which contains an amino-terminal region characterized by five 
imperfect short repeats, a highly charged middle (M) region, and a carboxyl-terminal 
region involved in translation termination during protein synthesis; a Sup35 mutant 
designated RA2-5, characterized by deletion of four of the repeat sequences in the N 
region; and a Sup35 mutant designated R2E2 (bottom), into which two additional copies 
of the second repeat segment have been engineered into the N region. Also depicted is 
the frequency with which yeast strains carrying these various Sup35 constructs were 
observed to spontaneously convert from a [psi-] to a [PSI+] phenotype. 

DETAILED DESCRIPTION OF THE INVENTION 
The present invention expands the study of prion biology beyond the 
contexts where it has heretofore focused, namely fundamental research directed to 
developing a greater understanding of prion biology and medical research directed to 
developing diagnostic and therapeutic materials and methods for prion-associated disease 
states, and provides diverse and practical applications that advantageously employ certain 
unique properties of prions, including one or more of the following: 

(1) prion genes and proteins afford the possibility of two stable, heritable 
phenotypes and the ability to effect at least one switch between such phenotypes; 

(2) prions provide the ability to sequester a protein or protein-binding 
molecule into an ordered aggregate; 

(3) prion-protein-aggregates are easily isolated from-cells containing them; 

with at least some prions, the ordered aggregate is fibrillar in structure, stable and 



unreactive, a collection of properties that is exploited in certain embodiments of 
the invention; 

(4) a protein of interest that is fused to a prion protein can potentially retain its 
normal biological activity even when the fusion has formed an ordered prion 
aggregate; and 

(5) a protein of interest that is fused to a prion protein can switch from an 
active to an inactive state, and this change is reversible. 

Prion proteins have been observed to exist in at least two stable 
conformations in cells that synthesize them. For example, the PrP protein in mammals 
has been observed in a soluble PrP*^ conformation in "normal" cells and in an aggregated, 
insoluble PrP^"" conformation in animals afflicted with transmissible spongiform 
encephalopathies. Similarly, the Sup35 protein in yeast has been observed in a "normal" 
non-aggregated conformation in which it forms a component of a translation termination 
factor, and also aggregated into fibril structures in [PSI^] yeast cells (characterized by 
suppression of normal translation termination activity). To the extent that scientific 
literature has ascribed any practical importance to these observations, the importance has 
focused on identifying materials and methods to modulate conformational switching, 
which might lead to treatments for prion-mediated diseases; or to detect the infectious 
Pj.pSc fyjj^ protect the food supply; or to diagnose infection and prevent its spread. At 
least in the case of the yeast Sup35 prion, the [PSr] phenotype can be eliminated by 
effecting an over-expression or under-expression of the heat shock protein Hspl04, and 
can be induced by effecting an over-expression of Sup35 or the Sup35 amino-terminal 
prion-aggregation domain. 

The practical applications that arise from the ability to alter the phenotype 
of a cells or an entire organism by transforming/transfecting cells with a polynucleotide 
that encodes a non-native protein (and/or that integrates into the cell's genome to cause 
production of a non-native protein) are legion and underlie a major portion of the entire 
biotechnology industry. Such applications include medical/therapeutic applications {e,g., 

gene therapy-to-treat-genetie disorders-sueh-as-hemophilia^gene therapy-to treat 

pathological conditions such as ischemia, inborn errors of metabolism, restenosis, or 
cancer); pharmacological applications (e.g., recombinant production of therapeutic 
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polypeptides such as erythropoietin, human growth hormone, angiogenic and anti- 
angiogenic peptides, or cytokines for therapeutic administration); industrial appHcations 
{e.g., genetic engineering of microorganisms for bioremediation or frost prevention; or 
recombinant production of catalytic enzymes, vitamins, proteins, or other organic 
5 molecules for use in chemical and food processing); and agricultural applications (e.g., 
genetic engineering of plants and livestock to promote disease resistance, faster growth, 
better nutritional value, environmental durability, and other desirable properties); just to 
name a few. In such biotechnology applications, a cell typically is 
transformed/transfected with a single novel gene to introduce a single phenotypic 

10 alteration that persists as long as the gene is present. Means of controlling the new 

phenotype conventionally involve eliminating the new gene, or possibly placing the gene 
imder the control of inducible or repressible promoter to control the level of gene 
expression. The present invention provides the realization that prion genes and proteins 
afford an additional, alternative means of biological control, because the introduction of a 

15 prion sequence into a protein introduces the possibility of two stable, heritable phenotypes 
and the ability to effect at least one switch between such phenotypes. Specifically, one 
can phenotypically alter a cell to produce a protein of interest by transforming/transfecting 
a cell with a gene encoding a prion-aggregation domain fiised to a protein of interest. To 
reduce or eliminate the activity of this protein, one induces the protein to undergo a 

20 conformational alteration and adopt a prion-like aggregating phenotype, thereby 
sequestering the protein. To re-introduce the original recombinant phenotype, one 
induces the protein to undergo a conformational alteration and adopt the soluble 
phenotype. 

By way of example, the phenotypic alteration potential of prion-like 
25 proteins can be harnessed to permit a species (plant, animal, microorganisms, fungi, etc.) 
to survive in a wider range of environmental conditions and/or quickly adopt to 
environmental changes. Species that thrive in one environment often have difficulty in 
another. For example, some photosynthetic organisms grow well under bright light 

because they_produce-pigments-that-protect the-organism-from-potentially-toxic-effects of 

30 bright light, whereas others grow well under low light conditions because of other Hght- 
gathering pigment systems that efficiently harvest all available light. By placing the 
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regulators for such systems under a prion control mechanism, prion conformational 
switching is advantageously harnessed for increased environmental adaptability. 

A preferred prion system for harnessing environmental adaptation is a 
prion system such as the Sup35 or Ure2 yeast prions that undergo natural switching. In 
5 these systems, the yeast prion state and phenotype arises naturally (in a non-prion 

population) at a frequency of about one per milHon cells, and is lost at a similar frequency 
in a prion population. Thus, in any yeast culture of reasonable size, both phenotypes will 
be present. If the prion state imparts a growth advantage under some conditions and the 
non-prion state imparts a growth advantage under other conditions, the culture as a whole 

10 will survive and thrive under either set of conditions. Although one phenotype may be 
disfavored and selected against, it will nonetheless be present (due to natural switching 
behavior of the prion) and ready to "take over" the culture if conditions change to favor 
it. In this regard, also contemplated as an aspect of the invention is a cell culture 
comprising cells transformed or transfected with a polynucleotide according to the 

1 5 invention, wherein the cells express the chimeric polypeptide encoded by the 
polynucleotide, and wherein the cell culture includes cells wherein the chimeric 
polypeptide is present in an aggregated state and cells free of aggregated chimeric 
polypeptide. 

The prion-mediated flexibility described in the preceding paragraph 
20 possesses a crucial advantage over traditional "switches" because it does not depend upon 
fortuitous genetic mutations and reversions. Each phenotype arises from the same 
genotype and each is available within the population, even under selective conditions. 
Thus, in a cultured photosynthetic organism as described above, transformation with one 
or more genes encoding an aggregating domain fused to pigment or protective proteins 
25 will provide an increased adaptability to varying light conditions. 

This "natural switching" quality of prions has applicability to a wide 
variety of variable growth conditions that might be encountered by cultured cells or 
organisms, including varied levels of salinity, metals, carbon sources, and toxic metabolic 

by products. AdaplabiIitv_tO-Such-environments-is-often-mediated-by"Qne-or-a-few 

30 proteins, such as metal-binding proteins and enzymes involved in the synthesis or 

breakdown of particular organic compounds. The advantages of prion natural switching 
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are considered particularly well suited for fields of bioremediation, where multiple 
environmental conditions are expected to be encountered, and fermentation processes 
where nutrients are consumed and fermentation by products are created, changing an 
environment over time. 

5 By way of another example, pigment genes for flowers, textile fibers (e.g., 

cotton), or animal fibers (e.g., wool) are placed under the control of prion-Hke aggregating 
elements. A plurality of colors and/or color patterns is achieved in a single plant by 
altering growing conditions to induce or cure the prion regulated pigment, or by 
subjecting portions of the plant to chemical agents that modulate conformation of the 
10 prion protein. 

The present invention also provides practical applications stemming fi*om 
the realization that prions provide the ability to sequester a protein of interest or the 
protein's binding partner into an ordered aggregate. This property is demonstrated herein 
by way of example involving the prion aggregation domain of the yeast Sup35 gene fused 
15 to a glucocorticoid receptor. When cells expressing this fusion are in a non-prion 
phenotype (i.e., the fusion protein is soluble), the cells are susceptible to hormonal 
induction through the glucocorticoid receptor, and one can induce the expression of a 
second gene that is operably fijsed to a glucocorticoid response element. However, when 
cells expressing the fusion are in a prion phenotype (i.e., the fusion protein is forming 
20 aggregates), the susceptibility to hormonal induction is reduced, because the 

glucocorticoid receptor that is sequested into cytoplasmic aggregates is unable to effect its 
normal activity in the cell's nucleus. 

This ability to a sequester protein or protein-binding partner has direct 
application in the recombinant production .of biological molecules, especially where 
25 recombinant production is difficult using conventional techniques, e.g., because the 

molecule of interest appears to exert a toxic or growth-altering effect on the recombinant 
host cell. Such effects can be reduced, and production of the polypeptide of interest 
enhanced, by expressing the polypeptide of interest as fusion with a prion aggregation 
domain i n a host cell tha t h_as^^oris,iiiduc.edJo,have,-a-prion-aR^egatiQn-phenQt 
30 such host cells, the recombinant fusion protein forms ordered aggregates through its prion 
aggregation domain, thereby sequestering the protein of interest as part of the aggregate. 



and reducing its adverse effects on other cellular components or reactions. (If the 
molecule of interest is the binding partner of the non-prion domain of the fusion protein, 
the binding partner also will be sequestered by the aggregate, provided that the binding 
activity of this domain is retained in the aggregate.) 
5 The present inventors also provide practical applications stemming from 

the fact that prion aggregates can be readily isolated from cells containing them. Because 
prions form insoluble aggregates in appropriate host cells, it is relatively easy to separate 
aggregated prion protein from most other proteinaceous and non-proteinaceous matter of 
a host cell, which is comparatively more soluble, using centrifugation techniques. When 
10 the prion protein is fused to a protein of interest, the protein of interest can likewise be 
separated from most other host cell impurities by centrifiigation techniques. Thus, the 
present invention provides materials and methods useftil for the purification of virtually 
any recombinant protein of interest. If a recognition sequence for chemical or enzymatic 
cleavage is included between the prion aggregation domain and the protein of interest, the 
15 protein of interest can be cleaved and separated from the insoluble prion aggregate in a 
second purification step. Such protein production techniques are considered an aspect of 
the invention. For example, the invention provides a method comprising the steps of: 
expressing a chimeric gene in a host cell, the chimeric gene comprising a nucleotide 
sequence encoding a SCHAG amino acid sequence ftised in frame to a nucleotide 
20 sequence encoding a protein of interest; subjecting the host cell, or a lysate thereof, or a 
growth medium thereof to conditions wherein the chimeric protein encoded by the 
chimeric gene aggregates; and isolating the aggregates. In one variation, the method 
further includes the step of cleaving the protein of interest from the SCHAG amino acid 
sequence and isolating the protein of interest. - - 

25 Moreover, the improved purification techniques are not limited to proteins 

fused to a prion domain. For example, a host cell expressing a prion aggregation domain 
fused to a protein of interest can be used in a like manner to purify a binding partner of 
the protein of interest. For example, if the protein of interest is a growth factor receptor, 
it can be used to se questerJhe,grow-thJactorJtself— bv-virtue-Qf-the-reeeptor's-affini^t^^ — 
30 the growth factor. In this way, the growth factor can be similarly purified, even though it 
is not itself expressed as a prion fusion protein. If the protein of interest comprises an 
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antigen binding domain of an antibody, then the same techniques can be used to sequester 
and purify virtually any antigen (protein or non-protein) that is produced by the host cell 
or introduced into the host cell's environment. In this regard, it is well-known in the 
literature that relatively short variable (V) regions within antibodies are largely 
5 responsible for highly specific antigen-antibody immunoreactivity, and such 

antigen-binding regions occur within particular regions of an antibody's primary structure 
and are susceptible to isolation and cloning. (See, e.g., Morrison and Oi, Adv. Immunol., 
44:65-92 (1989). For example, the variable domains of antibodies may be cloned from 
the genomic DNA of a B-cell hybridoma or from cDNA generated from mRNA isolated 

10 from a hybridoma of interest. Likewise, it is known in the art how to isolate only those 
portions of the variable region gene fragments that encode antigen-binding 
complementarity determining regions ("CDR") of an antibody, and clone them into a 
different polypeptide backbone. [See, e.g., Jones et al. Nature, 327:522-525 (1986); 
Riechmann et aL, Nature, 552:323-327 (1988); Verhoeyen et al. Science, 259:1534-36 

15 (1988); and Tempest et al, Bio/Technology, P:266-71 (1991).] A polypeptide comprising 
an antigen binding domain of an antibody of interest might comprise only one or more 
CDR regions from an antibody, or one or more V regions from an antibody, or might 
comprise entire V region fragments linked to constant domains from the same or a 
different antibody, or might comprise V regions that have been cloned into a larger, 

20 non-antibody polypeptide in a way that preserves their antigen binding characteristics, or 
might comprise antibody fragments containing V regions, and so on. Also, it is known in 
the art to select and isolate polypeptides comprising antigen binding domains of 
antibodies using techniques such as phage display that obviate the need to immunize 
animals and work with native antibodies at all. 

25 The present invention also provides practical applications stemming from 
the fact that at least some proteins of interest will retain their normal biological activity 
when expressed as a fiision with a prion aggregation domain, even when the fusion 
protein forms prion-like aggregates. This feature of the invention is demonstrated by way 
of example below using the S. cerevisiae S u p35 prion a ggregation-domain-fused-to-a 

30 green fluorescent protein (GFP). Even in [PST] cells or in other cells where aggregation 
of the fusion protein into fibrils has occurred, the GFP fluoresces green under blue light. 
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indicating that the GFP portion of the fusion has retained a biologically active 
conformation. 

When the example is repeated substituting a protein of interest for the GFP 
marker protein, ordered aggregates comprising a biologically active protein of interest are 
5 produced, hi a preferred embodiment, the protein of interest is a protein that is capable of 
binding a composition of interest. For example, the protein of interest comprises an 
antigen binding domain of an antibody that specifically binds an antigen of interest; or it 
comprises a ligand binding domain of a receptor that binds a ligand of interest. Fibrils 
comprising such fusion proteins can be used as affinity matrices for purifying the 
10 composition of interest. Thus, aggregates of a chimeric protein comprising a SCHAG 
amino acid sequence fused to an amino acid sequence encoding a binding domain of a 
protein having a specific binding partner are intended as an aspect of the invention. 

In another preferred embodiment, the polypeptide of interest is an enzyme, 
especially an enzyme considered to be of catalytic value in a chemical process. Fibrils 
1 5 comprising such fusion proteins can be used as a catalytic matrix for carrying out the 
chemical process. Thus, aggregates of a chimeric protein comprising a SCHAG amino 
acid sequence fused to an enzyme are intended as an aspect of the invention. 

In another preferred embodiment, ordered aggregates are created 
comprising two or more enzymes, such as a first enzyme that catalyzes one step of a 
20 chemical process and a second enzyme that catalyzes a dow^nstream step involving a 
"metabolic" product from the first enzymatic reaction. Such aggregates will generally 
increase the speed and/or efficiency of the chemical process due to the proximity of the 
first reaction products and the second catalyst enzyme. Aggregates comprising two or 
more proteins of interest can be produced in multiple ways, each of which is itself 
25 considered an aspect of the invention. 

It may be advantageous to attach fibers to a solid support such as a bead 
(e.g., a Sepharose bead) or a surface to create a "chip" containing loci with biological or 
chemical function. 

In one variation, each chimeric^ protein com prising-an-aggregation-domain— 
30 and a protein of interest is produced in a separate and distinct host cell system and 

recovered (purified and isolated). The proteins are either recovered in soluble form or are 
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solubilized. (Complete purification is desirable but not essential for subsequent 
aggregation/polymerization.) Thereafter, a desired mixture of the two or more proteins is 
created and induced into polymerization, e.g., by ''seeding" with a protein aggregate, by 
concentrating the mixture to increase molarity of the proteins, or by altering salinity, 
acidity, or other factors. The desired mixture may be 1 :1 or may be at a ratio weighted in 
favor of one chimeric protein {e.g., weighted in favor of an enzyme that catalyzes a slower 
step in a chemical process). The different chimeric proteins co-polymerize with the seed 
and with each other because they comprise compatible aggregation (SCHAG) domains, 
and most preferably identical aggregation domains, hi certain embodiments it may be 
desirable to include in the pre-aggregation mixture a polypeptide comprising the SCHAG 
domain only, without an attached enzyme, for the purpose of increasing the average space 
between individual enzyme molecules in the aggregate that is formed. The additional 
space may be desirable, for example, if the enzyme's substrate is a large molecule. 

In another variation, the two distinct host cell systems are co-cultured, and 
the chimeric transgenes include signal peptides to induce the cells to secrete the chimeric 
proteins into the common culture medium. The proteins can be co-purified from the 
medium or induced to aggregate without prior purification. 

In still another variation, the transgenes for two or more recombinant 
chimeric polypeptides are co-transfected into the same host cell, either on a single 
polynucleotide construct or multiple constructs. Such a host cell produces both 
recombinant polypeptides, which can be induced to polymerize in vivo in a prion 
phenotype host, or can be recovered in soluble form and induced to polymerize in vitro. 
The present invention also exploits the fact that at least certain prion proteins form 
aggregates that are fiber-like in shape; strong; and resistant to destruction by heat and 
many chemical environments. This collection of properties has tremendous industrial 
application that heretofore has not been exploited. Thus, in one embodiment, the 
invention provides polypeptides comprising SCHAG amino acid sequences which have 
been modified to comprise a discrete number of reactive sites at discrete locations. The 

polypeptides can be recombinant ly produced and purified and aggregated-into-robust 

fibers resistant to destruction. The reactive sites permit modification of the polypeptides 
(or the fibers comprising the polypeptides) by attachment of virtually any chemical entity. 
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such as pigments, light-gathering and hght-emitting molecules for use as sensors, 
indicators, or energy harnessing and transduction; enzymes; metal atoms; organic and 
inorganic catalysts; and molecules possessing a selective binding affinity for other 
molecules. Electrical fields may be applied to fibers that are labeled with metal atoms, so 
5 that the fibers can be oriented in a specific direction. Because the fiber monomers are 

protein, conventional genetic engineering techniques can be used to introduce any number 
of desired reactive sites at precise locations, and the precise location of the reactive sites 
can be studied using conventional protein computer modeling as well as experimental 
techniques. Proteins and fibers of this type enjoy the utilities of the chimeric proteins 

10 described above (e.g., as chemical purification matrices, chemical reaction matrices, etc.) 
and additional utility due to the ability to bind a potentially infinite variety of non-protein 
molecules of interest to the reactive sites. The fibers can be grown or attached to solid 
supports to create devices comprising the fibers. 

These and other aspects of the invention will be better understood by 

1 5 reference to the following examples. The examples are not intended to limit the scope of 
the invention, and variations will be apparent to the reader fi-om the entirety of this 
document. 

Example 1 

Construction and assaying of a chimeric, 
20 prion-like gene and protein with yeast Sup35 protein 

The following experiments were performed to demonstrate that a prion- 

determining domain of a prion-like protein can be fiised to a polypeptide from a wholly 

different protein to construct a novel, chimeric gene and protein having prion-like 

properties. The relevance of these experiments to the present invention also is explained. 

25 A. Construction of a NMSup35-GR chimeric gene 

The yeast {Saccharomyces cerevisiae) Sup35 protein (SEQ ED NO: 2, 685 
amino acids, Genbank Accession No. M21129) possesses the prion-like capacity to 
undergo a self-perpetuating conformational alteration that changes the functional state of 
Sup35 in a manner that creates a hmlablexhange-in-phenot>pe7T;?rperiments have 
^30 demonstrated that it is the amino-terminal (N region, amino acids 1-123 of SEQ ED NO: 
2) or the amino-terminal plus middle (M, amino acids 124-253 of SEQ ID NO: 2) regions 
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of Sup35 that are responsible for this prion-like capacity. See Glover et ai. Cell, 89: 811- 
819 (1997); see also King et al, Proc. Natl Acad. ScL USA, 94:6618-6622 (1997) (N- 
terminal polypeptide fragment consisting of residues 2-114 of Sup35 spontaneously 
aggregates to form thin filaments in vitro.). The M domain is highly charged and 
5 therefore acts to maintain the protein in solution. This property causes the aggregation 
process to proceed more slowly, providing beneficial control to the system. 

A chimeric polynucleotide Fig. 1 and (SEQ ID NO: 50) was constructed 
comprising a nucleotide sequence encoding the N and M domains of Sup35 (Fig. 1 and 
SEQ ED NO: 50, bases 1 to 759) fused in-frame to a nucleotide sequence (derived from a 
10 cDNA) encoding the rat glucocorticoid receptor (GR) (Genbank Accession No. M14053, 
Fig. 1 and SEQ ID NO: 50, bases 766-3150), a hormone-responsive transcription factor, 
;S followed by a stop codon. This construct was inserted into the pRS3 1 6CG (ATCC 

Accession No. 77145, Genbank No. U03442) and pGl (Guthrie & Sink, "Guide to Yeast 
M Genetics and Molecular Biology" in Methods of Enzymology, Vol. 194, pp. 389-398 

E^i 15 (1981)) plasmids under the control of either the CUPl promoter (plasmid pCUPl- 
= NMGR, inducible by adding copper to the growth medium) or the constitutive GPD 

O promoter (plasmid pGDP-NMGR). The nucleotide sequences of CUPl and GDP 

(Genbank Accession No. Ml 3807) promoters are set forth in SEQ ID NOs: 1 1 and 48, 
't;! respectively. The GR coding sequence without NM, in the same promoter and vector 

13 20 constructs (plasmids pCUPl-GR and pGDP-GR), served as a control. GR activity in 

transformed yeast was monitored with two reporter constructs containing a glucocorticoid 
response promoter element (GRE) [Schena & Yamamoto, Science, 2^7:965-967 (1988)] 
fused to either a p-galactosidase (Swiss-Prot. Accession No. P00722) or to a firefly 
luciferase (Genbank Accession No. M l 5077) coding sequence. When GR is activated by 
25 hormone, e.g. , deoxycorticosterone (DOC), it normally binds to the GRE and promotes 
transcription of the reporter enzyme in either mammals or yeast. See M. Schena and K. 
Yamamoto, Science 2^7/965-967 (1988). 



30 



Construction of a NMSUP35dGEP-chimeric-gene 



A chimeric gene comprising the NM region of Sup35 fused to a green 
fluorescent protein (GFP) sequence and under the control of the CUPl promoter was 
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constructed essentially as described in Patino et ai. Science, 273: 622-626 (1996) 
(construct NPD-GFP), incorporated by reference herein. (The use of GFPs as reporter 
molecules is reviewed in Kain et ciL, Biotechniques, 79:650-655 (1995); and Cubitt et aL, 
Trends Biochem. ScL, 20:448-455 (1995), incorporated by reference herein.) The 
resulting construct encodes the NH2-terminal 253 residues of Sup35 (SEQ ID NO: 2) 
fused in-frame to GFP. The NM-Sup35-GFP encoding sequence was amplified by PGR 
and cloned into plasmid pCLUC [D. Thiele, MoL Cell, Biol., 8: 745 (1988)], which 
contains the CUPl promoter for copper-inducible expression. A similar construct was 
created substituting the constitutive GDP promoter for the CUPl promoter. An identical 
GFP construct lacking the NM fusion also was created. 

C. Transformation and phenotvpic analysis of Ivsi-] and \PSr] yeast 
1. Constructs Regulated by the CUPl promoter 

The GR and NM-GR constructs regulated by the CUPl promoter on a low 
copy plasmid (ura selection) were transformed into [psi-] and [PSr] yeast cells (strain 
74D) along with a 2 |l (high copy number) plasmid containing a GR-regulated p- 
galactosidase reporter gene with leucine selection. Transformants were selected by sc- 
leu-ura and used to inoculate sc.-leu-ura medium. Cultures were grown overnight at 
SC^C, and induced by adding copper sulfate to the medium to a final 0-250 |aM copper 
concentration. 

After 4 to 24 hours of induction, both proteins were expressed at a similar 
level in [psi-] cells, and both the GR and NM-GR transformed [psi-] cells produced 
similar levels of reporter enzyme activity in response to hormone (DOC added to a final 
concentration of 10 [IM at the time of copper sulfate induction). Virtually no reporter 
enzyme activity was detected without hormone. The fact that both GR and NM-GR 
constructs resulted in similar levels of activity indicates that the NM fusion does not 
intrinsically alter the ability of GR to function in hormone-activated transcription, 
demonstrating the utility of the NM domain as a fusion protein tag. 

In contrastrwhen-the same constructs-were transformed-into-yeast cells 

that contain the heritable, conformationally-altered form of Sup35 [PST], GR activity was 
reduced in cells expressing the NM-GR fusion construct, compared to cells expressing 
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GR. Thus, pre-existing prions (which comprise self-coalescing aggregates of NM- 
containing Sup35 protein) can interact with NM-GR. Similar results were obtained with 
NM-Green Fluorescent Protein (GFP) constructs: NM-GFP interacted with pre-existing 
[PSr] elements, but GFP alone did not. 
5 An important difference existed between the NM-GR and NM-GFP 

studies in the [PSr] cells, however. Unlike the NM-GR fusion, the NM-GFP fusion 
retained similar GFP activity with the [PSr] prion, i.e., the NM-GFP fusion still glowed 
green. This difference in activity is explained by the facts that, for biological activity, GR 
needs to be in the nucleus, bind to DNA, and interact in specific ways with other elements 
10 of the transcription machinery. When NM-GR is sequestered in [PSr] cells by 

interacting (aggregating) with the Sup35 prion filaments, the GR function is diminished. 

-1; 2. Constructs regulated by the constitutive GPP promoter on a high 

J^; copv plasmid. 

Ui A set of experiments demonstrated that plasmids that cause expression of 

- ^' 15 NM at a high level can be successfully transformed into [psi-] yeast cells, but not into 
Q [PSr] cells. Apparently, over-expressed NM causes excessive prion-like aggregation of 

13 endogenous Sup35 in cells that are already [PSf], eliminating so much translation 

J™ termination factor function that the yeast cells cannot survive. 

Q When a high copy plasmid vector comprising the NM-GR open reading 

20 frame under the control of the constitutive GPD promoter was used to transform [psi-] or 
[PSr] yeast, no [PSr] transformants were obtained, whereas [psi-] transformants were 
readily obtained. The control GR construct in the same vector and under control of the 
same promoter transformed equally weirinto both and |/?5z-]^^^c 

When amino acids 22-69 in the N domain of Sup35 are deleted, the 
25 resultant protein fails to form ordered aggregates, and yeast comprising this Sup35 variant 
fail to adopt a [PSF] phenotype. When these same amino acids were deleted from the 
high copy number NM-GR plasmid, the inability to transform [PSI"] cells was eliminated: 

transformants-were-obtained-as-readiiy-in-[-PSI-]-as-[psi-]-eells: 

Both NM-GR and GR [psi-] transformants were used to inoculate sc.-leu- 
30 trp medium, and the cultures were grown at 30''C overnight, diluted into fresh medium to 
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achieve a cell density of 2 - 4 x 10^ cells/ml, induced with DOC ( 1 0 |-tM final 
concentration), and grown for an additional period varying from 1 hour to overnight. 
Analysis of marker gene activity in the transformed [psi-] cells demonstrated that 
hormone responsive transcription was lower in NM-GR transformants than in GR 
5 transformants. Western blotting using an anti-GR monoclonal antibody (Affinity 

Bioreagents Inc., MAI -5 10) was used to examine the levels of NMGR and GR expression 
in these cells. Although cells carrying the NM-GR fusion had lower levels of GR activity, 
the NM-GR protein was actually expressed at a much higher level than the GR protein 
without the NM domain. Thus, the reduced levels of hormone-activated transcriptional 
10 activity were not due to an effect of NM on the accumulation of the transcription factor, 

^ but to an alteration in GR activity in the NM-GR-expressing cells. This reduced activity 

suggested that NM-GR is capable of undergoing a de novo, prion-like alteration in 

%1 function when it is expressed at a sufficiently high level. 

H To confirm that NM-GR was forming prions de novo in the transformed 

w 1 ^ [ps^-] cells into which it had been introduced, such cells were induced with copper to 

express NM-GR and then were plated onto copper- free media lacking adenine, and 
D therefor selective for the [PSr] element/phenotype. See Chemoff et aL, Science, 268: 

a 880 (1995), and Cox et al.. Yeast, 4(3): 159-178 (1988). A substantial fraction of the 

cells were able to grow on medium selective for [PSt], suggesting that the highly 
13 20 expressed NM-GR was responsible for the formation of new prions putatively containing 
both NM-GR and Sup35 protein. Moreover, the number of colonies obtained varied with 
the level of copper induction prior to plating. This change in the growth properties of the 
cells was observed to be heritable and was maintained even under conditions where the 
NM-GR plasmid construct was lost by the host cells, indicating that NM-GR had induced 
25 the formation of a new Sup3 5 -containing prion. 



D. Analvsis of NMGR-induced phenotvpe in cells carrying a deletion 
of the NM region of Sup35 . 

To fu rther confirm that NM-GR was truly functionin .^ as an independent, 

novel prion, experiments were conducted to determine whether an NM-GR prion was 

30 formed independently of both the yeast [PSt^ element and the endogenous Sup35 protein. 
Specifically, the GPD-regulated GR and NM-GR constructs were co-transformed with 
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plasmid p5275 (containing GRE linked to a firefly luciferase reporter gene) into a yeast 
strain ( ANMSUP35) carrying a deletion of the NM region of the SUP35 gene. Three 
independent transformants of each construct (GR or NM-GR) were examined. Colonies 
were picked and grown overnight in SC selective media (-trp, -ura) at 30*^0. Thereafter, 
5 deoxycorticosterone (DOC) was added to the growth medium to a final concentration of 
10 l-lM. Luciferase activity was assayed in intact cells after 25 hours of DOC induction. 

All three transformants expressing the NM-GR protein showed lower 
levels of GR activity (specific activities of about 4, 5, 4) than the three transformants 
expressing GR without the NM fusion (specific activities of about 23, 28, and 39). The 

10 differences in GR activity was observed after 1 hour of hormone induction and appeared 
to increase after 5.5 or after 25 hours of induction. 

Westem blotting was conducted to determine whether the differences in 
activity were the result of differences in protein concentration. Ethanol lysates were 
prepared fi-om 3 ml yeast cultures expressing GR or NMGR twenty-five hours after the 

15 addition of DOC. About 50 |J,g total protein was analyzed by SDS/PAGE and 

immuoblot. The protein gel was transferred onto PVDF membranes and probed with a 
monoclonal antibody against GR (Bu-GR2, Affinity Bioreagents, Golden Colorado). The 
same membrane was later stained with Coomassie blue to semiquantitatively evaluate 
total protein. The Westem studies again showed that the levels of NM-GR were higher 

20 than the levels of GR alone. 

E. Effect of Guanidine Hydrochloride and Hspl04 on NM-GR prions . 

When the yeast having [URE3] or [PST*^] pheno types are passaged on 
medium containing low concentrations of guanidine hydrochloride (GdHCl), their prion 
determinants change ("cure") at a high fi^equency fi-om the aggregated, inactive prion state 
25 into the active, unaggregated state, and such changes are heritable. These phenotypes also 
can be cured by over-expression of the chaperone Hspl04. 

Another series of experiments were conducted to assay for such curative 

behavior"in"yeast"harboring-an-NM-GR construct^The naturaTGR-protein contains a 

ligand-binding domain and hormone must be added to the medium to determine whether 
30 or not the protein is active. For this series of experiments, the hormone-binding domain 
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was removed from the NM-GR construct, creating an NM-GR fusion that was 
constitutively active. 

Yeast expressing the NM-GR chimeric construct and a glucocorticoid 
response element fused to a P-galactosidase marker exhibited different levels of prion- 
5 like behavior, manifested by different colony colors. In addition to white colonies 

(indicative of a prion-like state lacking p-gal induction) and blue colonies (indicative of 
soluble NM-GR and high levels of p-gal induction), medium blue and pale blue colonies 
also were observed. (Westem blotting indicated that differently colored colonies 
contained comparable amounts of GR protein.) These differently colored colonies were 

10 replica-plated onto plates containing 5 mM GdHCl and then subsequently replica-plated 
again onto X-Gal indicator plates. In control cells expressing vector alone (no NM-GR 
insert), white colonies remained white. However, all of the NM-GR-expressing colonies 
produced blue colonies. The efficiency of curing varied with the NM-GR strain: medium 
blue colonies produced almost entirely blue colonies, whereas pale blue colonies 

15 produced a mixture of blue and white colonies. 

To determine if the heritable loss of NM-GR activity is susceptible to 
Hspl04 curing, white colonies of cells expressing NM-GR were transformed with a GDP- 
HSP104 over-expression plasmid and streaked onto X-Gal indicator plates. Control cells 
transformed with empty vector remained white. In contrast, white cells transformed with 

20 the Hspl04 over-expression construct changed to blue. The blue cells remained blue 
upon-restreaking, indicating that transient over-expression of Hspl04 was sufficient to 
cure cells of the heritable reduction of NM-GR activity. 

When the same NM-GR constructs were used to transform yeast 
containing a deletion mutation of Hspl 04, white colonies were never produced. This 

25 finding is consistent with the observation that Hspl 04 mutations are incompatible with 
the maintenance of the [PSI^] phenotype. 

Together, the foregoing data indicate that the difference in GR activity 
observed when NM-GR is expressed at a high constitutive level is due to a heritable 

alteration in-GR-functionrrather than to an-alteration-in'GR'expression: 

30 Collectively, the foregoing experiments demonstrate that the amino- 

terminal domain of a prion-like yeast gene, Siip35, can be fused to a polypeptide from a 
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wholly different protein to construct a novel, chimeric gene and protein having prion-Uke 
properties. Significantly, these results are believed to be the first demonstration that a 
SCHAG protein domain can be fused to a non-native protein domain to form a chimera, 
expressed in a host cell that fails to express the native SCHAG protein, and still behave in 
5 a prion-like marmer. (Specifically, these results demonstrate that the NM domains of 

SUP35 will behave Hke a prion even w^hen the C-terminal domain of the protein is not the 
native Sup35 C-terminus, and even v^hen the host cell does not express an endogenous 
Sup35 protein containing an NM region.) The experiments also define exemplary assays 
for screening other putative prion-like peptides for their ability to confer a prion-like 
10 phenotype. (It will be apparent that the use of markers other than GFP, OR, luciferase, or 
P'galactosidase would work in such assays. The GFP marker is useful insofar as it 
provides an effective marker for localizing a fusion protein in vivo. The GR marker is 
additionally useful insofar as GR activity depends on GR localization in the nucleus, 
DNA binding, and interaction with transcription machinery; whereas GFP is active in the 
1 5 cytoplasm.) Exemplary prion-like peptides for screening in this manner are peptides 
identified according to assays described below in Example 5; mammalian PrP peptides 
responsible for prion- forming activity; and other known fibril-forming peptide sequences, 
such as human amyloid P (1-42) peptide. 
Si In addition, the experiments demonstrate an improved procedure for 

20 recombinant production of certain proteins that might otherwise be difficult to 

recombinantly produce, e.g.^ due to the protein's detrimental effect on the growth or 
phenotype of the host cell. For example, DNA binding and DNA modifying enzymes that 
might locate to a cell's nucleus and detrimentally effect a host cell may be expressed as a 
fusion with a SCHAG amino acid sequence fi*om a prion-like protein. In host cells 
25 wherein the aggregate- forming phenotype is present, the recombinant protein is 

"sequestered" into higher order aggregates. By virtue of this sequestration, the biological 
activity of the resultant protein in the nucleus is reduced. The fusion protein is purified 
from the insoluble fraction of host cell lysates, and can be cleaved from the fibril core if 

an-appropriate~endopeptidase recognition-sequence-has'been-included-in-the-fusion 

30 construct between the SCHAG amino acid sequence and the sequence of the protein of 

interest. (An appropriate endopeptidase recognition sequence is any recognition sequence 
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that is not present in the protein of interest, such that the endopeptidase will cleave the 
protein of interest from the fibril structure without also cleaving within the protein of 
interest.) 

Example 2 

Construction and assaying of a chimeric, 
prion-like gene and protein with yeast Ure2 protein 

The following experiments were performed to demonstrate that the prion- 

determining domain of yeast Ure2 protein also can be fused to a polypeptide other than 

the Ure2 functional domain to construct a novel, chimeric gene and protein having some 

prion-like properties. Two prion-like elements are known in yeast: [PSr] and [URE3], 

The underlying proteins, Sup35 and Ure2, each contain an amino-terminal domain (the N 

domain) that is not essential for normal function but is crucial for prion formation. The N 

domains of both Sup35 and Ure2 are unusually rich in the polar amino acids asparagine 

and glut amine. 

A. Construction of a NUre2-CSup35 chimeric gene 

A chimeric polynucleotide (Fig. 3, SEQ ED NO: 49) was constructed 
comprising a nucleotide sequence encoding the N domain of yeast {Saccharomyces 
cerevisiae) Ure2 protein (Genbank Accession No. M35268, SEQ ID NO: 3, bases 182 to 
376, encoding amino acids 1 to 65 (SEQ ID NO: 4) of Ure2 (NUre2)), fused in- frame to a 
nucleotide sequence encoding a hemagglutinin tag (SEQ ED NO: 13, TAG GGA TAG 
GAG GTG GGA GAG TAG GGT), fused in-frame to a nucleotide sequence encoding the 
G domain of yeast Sup35 (GSup35) protein that is responsible for translation-regulation 
activity of Sup35 (Genbank Accession No. M21 129, SEQ ID NO: 1, bases 1498-2793, 
encoding amino acids 254 to 685 of Sup35 (SEQ ID NO: 2)). At the 5' and 3' ends of 
this construct were 5' and 3' flanking regions, respectively, of the yeast Sup35 genomic 
DNA. This construct was inserted into the pRS306 plasmid (available from the ATGG, 

Manassas, Virginia, USA, Accession No. 77141 ; see also Genbank Accession No. 

U03438) as shown in Figures 2 and 3, and used to transform yeast as described below. 
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B. Transformation and phenotvpic analysis of yeast 

To replace the Sup35 gene with the NUre2-CSup35 chimeric gene, the 
first step was to integrate the gene fragment into the yeast genome. Freshly grown cells 
from overnight culture were collected and resuspended in 0.5 ml LiAc-PEG-TE solution 
5 (40% PEG4000, lOOmM Tris-HCL, pH7.5., 1 mM EDTA) in a 1.5 ml tube. 100 jlg/lO 
[Xl carrier DNA (salmon testis DNA, boiled 10 minutes and chilled immediately on ice) 
and 1 \Xg/2 \ll of transforming plasmid DNA were added and mixed. This transformation 
mixture was incubated overnight at room temperature and then heat shocked at 42°C for 
1 5 minutes. 1 00 |-Ll of transformation mixture were then spread onto a uracil dropout 
10 plate. After transformation, selection for Ura+ results in an integration event, such that 
native and chimeric genes bracket the URA3-containing plasmid sequence. 
^ Transformants were picked and cells having the integrated chimeric gene were confirmed 

iil by genomic PGR and Westem blot. 

M. The second step of the replacement involved the excision or "popping out" 

15 of the wildtype Sup35 gene through homologous recombination between the native Sup35 
^ ^ and the chimeric sequence. Popout of the plasmid was monitored by screening for 

Q colonies that are ura- and therefore resistant to the drug 5-fluoroorotic acid (5-FOA). 

li: Cells with NUre2-CSup35 integrated were thus plated onto 5-FOA medium to select for 

■y those that have the plasmid sequence containing one copy of the Sup35 gene popped out. 

p 20 Clones in which the native Sup35 gene had been replaced with the chimeric gene were 
then screened by means of colony PGR and further confirmed by Westem blot. 

To screen for yeast strains that have gene integration and replacement, a 
Ure2 coding sequence N- terminal primer and a Sup35 coding sequence primer were used 
for PGR reactions. The NUre2-CSup35 DNA fragment can only be amplified from 
25 genomic DNA of cells containing the chimeric gene. To confirm that only the fusion 
protein of NUre2-CSup35 was expressed in those cells that have the gene replacement, 
yeast cells were lysed and the cell lysates were run on SDS-polyacrylamide gel and 
proteins were transferred to PVDF immunoblot. Since there is a hemagglutinin (HA) tag 

inserted-between4vfUre2-and-GSup35~Westeni-blots were then probed-with^ 

30 antibody from Boehringer Mannheim. To confirm that NUre2-GSup35 is the only copy 
of Sup35 gene in yeast genome, Westem blots were also probed with an antibody against 



the middle region of Sup35 protein. Loss of antibody signal verified that the NM region 
of Sup35 gene had been replaced with the N-terminus of Ure2. Thus, the transformed 
cells were characterized by a deleted native Sup35 gene that had been replaced by the 
NUre2-CSup35 chimeric gene. 

Transformed colonies carrying the chimeric NUre2-CSup35 gene of 
interest were grown on rich medium (YPD) at 30'^C. The resuhant colonies were streaked 
onto fPST^J selective medium (SD-ADE) and incubated at 30°C to determine whether 
some or all contained a [PSr] phenotype. Two different types of colonies were observed. 
Some showed normal translational termination characteristic of a [psi-] phenotype. 
Others showed the suppressor phenotype characteristic of [PSI^] cells. Both phenotypes 
were very stable and were inherited from generation to generation of the transformed 
yeast cells. 

To determine whether the observed difference in translational fidelity was 
due to a heritable change in protein conformation, cells were lysed and the lysates 
subjected to centriftigation at 12,000 or 100,000 x g for 10 minutes. Supematants and 
precipitate fractions were screened for the fusion protein using an anti-HA antibody 
(HA- 11, Covance Research Products Inc.). The cells that showed reduced translational 
fidelity also showed aggregation of the NUre2-CSup35 ftision protein, whereas the fusion 
protein did not appear aggregated in cells having normal translation termination 
characteristics. 

The foregoing experiments demonstrate that the amino-terminal domain of 
another prion-like yeast gene, Ure2, can be fiised to a polypeptide derived from a wholly 
different protein to construct a novel, chimeric gene and protein having prion-like 
properties. These results represent the first such demonstration of this kind. [Compare 
Maison & Wickner, Science, 270: 93 (1995) (Ure2,.65/p-gal fusion did not change the 
activity of the P-galactosidase enzyme) and Paushkin et ai, EMBO J., 15(12): 3127-3134 
(1996) (GST-NSup35 chimeric construct did not allow native Sup35 to adopt an altered 
state.)] 

Several-factors are-suggested-for-aehieving-prion-like-behavior with 

chimeric genes that comprise SCHAG sequences. First, it is preferable to include the 
SCHAG sequence at a location in the chimeric gene {e.g., amino-terminus or carboxy- 
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terminus) that corresponds to the location at which it is found in its native gene. For 
example, if NSup35 is selected as the SCHAG sequence, then the chimeric gene 
preferably is constructed with NSup35 at the amino-terminus, preceding the sequence 
encoding the polypeptide of interest. Second, it is preferable to include a spacer region 
of, e,g,, at least 5,10, 20, 30, 40, or 50 amino acids, and preferably at least 60, 70, 80, 90, 
100, 120, 130, 140, or 150 amino acids, to separate the SCHAG domain from other 
domains and reduce the likelihood of steric hinderance caused by other domains. The 
length of spacer apparently can be quite large because a chimeric construct comprising 
whole Sup35 fused to Green Fluorescence Protein appears to act as a prion in preliminary 
experiments. Third, it is preferable if the protein of interest is a protein that does not 
itself naturally form multimers, because multimer formation of the protein of interest is 
apt to cause steric interference with the ordered aggregation of the SCHAG domain. 
(Maison & Wickner's research involved P-galactosidase, which forms a tetrameric 
functional unit.) The experiments also demonstrate an altemative assay system 
CSup35 fusions) to the GFP and GR assay systems described in the preceding example to 
screen peptide sequences for their ability to confer prion-like phenotypic properties. 

Also contemplated are fusion proteins comprising the M domain of Sup35, 
or portions of fragments thereof, fused to a different protein to generate a novel protein 
with prion-like activities. Likewise, fusion proteins displaying prion-like properties, 
comprising portions or fragments of the N domain, or comprising portions or fragments 
of the N and of the M domain are also contemplated. 

Example 3 

Modulation of propensity of protein to form prion-like aggregates 

The following experiments demonstrate that the propensity of novel 
chimeric proteins to aggregate into prion-like fibrils can be modulated by varying the 
number of oligopeptide repeats in the SCHAG portion of the chimeric protein. An 
increased propensity to form such fibrils is useful in instances where the fibrils 
themselves-Gomprise-a-desirable-end-produet-to-be-harvested-frGmcellsr^.g'.vvia-lysis-and- 
centrifugation; and in instances where fibril formation in vivo is desired to phenotypically 
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alter a cell, e.g., by sequestering a biologically active molecule in the cell away from the 
molecule's normal subcellular region of biological activity. 

The yeast Sup35 protein contains an oligopeptide repeat sequence 
(PQGGYQQYN, SEQ ID NO: 2, residues 75 to 83; with imperfect repeats at residues 41 
5 to 50; 56 to 64; 65 to 74; and 84 to 93). The following experiments demonstrated that an 
expansion of this oligopeptide repeat in the NM region of Sup35 increases the rate of 
appearance of new, heritable, [PSTJ-like elements, whereas decreasing the number of 
repeats lessened the rate of appearance of such elements. 

Three expression vectors were created for the experiment containing a 

10 chimeric gene comprising a CUPl promoter sequence (SEQ ID NO: 11) operably linked 
to a sequence encoding a Sup35 NM region, fused in- frame with a "superglow" GFP 
encoding sequence (SEQ ID NO: 39). In the first construct (Ra2-5), the Sup35 NM 
region had been modified by deleting four of the five oligopeptide repeats found in the 
native N region (SEQ ID NOs: 14 & 15). In the second construct (R2E2), the Sup35 NM 

1 5 region had been modified by twice expanding the second oligopeptide repeat found in the 
native N region, creating a total of seven oligopeptide repeats (SEQ ID NOs: 16 & 17). In 
the third construct, the native Sup35 NM region was employed (SEQ ED NO: 1, 
nucleotides 739 to 1506, encoding residues 1 to 256 of SEQ ED NO: 2). The CUPl 
promoter permitted control of the expression of the chimeric proteins by manipulation of 

20 copper ion concentration in the growth medium. [See Thiele, D.J., Mol. Cell Biol, 8: 

2745-2752 (1988).] The attachment of GFP to NM permitted visualization of the mutant 
proteins in living cells. 

Each of the three above-described NM-GFP constructs were introduced via 
homologous recombination at the site of the wild-type Sup35 gene into [psi-] yeast cells 

25 canying a nonsense mutation in the ADEl gene (strain 74-D694 [psi-]), and monitored 
for the fi*equency at which cells converted to a [PSt^ phenotype. Cell cultures in the log 
phase of growth at 30 °C were induced to express the GFP-fusion proteins by adding 
CUSO4 to the cultures cells to a final concentration of 50 |-LM. For analysis via 

fLuoresc.encejTiicroscopy,xells-were-fixed-with4-%-f^ 

30 twenty hours of culture. For analysis of [PST^] induction, cells over-expressing the GFP 
fusion proteins were serially diluted and spotted onto YPD and SD-ADE media after four 
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hours and twenty hours. Conversion was measured by the abihty of cells to grow on 
medium without adenine (SD-ADE). The [PSr] phenotype causes readthrough of 
nonsense mutations, producing sufficient protein to suppress the ADEl mutation and 
allow growth without adenine. 
5 Cells were induced with copper for 4 hours to promote expression of the 

chimeric gene and serially diluted, and then aliquots of each dilution were plated on 
SD-ADE, conditions that allowed loss of the plasmid. To demonstrate that the initial 
cultures contained similar numbers of cells, serial dilutions from each culture also were 
plated on rich medium (YPD) which allowed the growth of all cells in the culture. After 

10 incubating the plates for 48 hours at 30°C, colonies on each plate were counted. 

Cells expressing the oligopeptide repeat expansion mutation converted to 
[PSr'] at a much higher frequency than cells expressing the native Sup35NM-GFP, which 
in turn converted to [PSr] at a higher frequency than cells expressing the oligopeptide 
repeat deletion mutation. The observed conversion results were specifically attributable 

15 to the production of the chimeric proteins, because the conversion to [PSr] did not occur 
in cells that were not induced with copper (control). 

In a related experiment, the repeat expansion and repeat deletion mutations 
were introduced into a full-length Sup35 protein-encoding sequence to create constructs 
encoding the NM(R2E2) and NM(RA2-5) fiised to the CSup35 domain. These constructs 

20 were introduced into the genome of [psi-] yeast strain 74-D694 with the wild-type Sup35 
promoter, in each case replacing the native Sup35 gene. Transform ants were selected on 
uracil-deficient medium and confirmed by genomic PCR. Recombinant excision events 
were selected on medium containing 5-fluoroorotic acid. [See Ausubel et aL, Current 
Protocols in Molecular Biology, Green Publishing Associates and Wiley Interscience, 

25 New York (1991).] Strains in which wild-type Sup35 was replaced with the R2E2- 

CSup35 and RA2-5CSup35 variants were screened by PCR and confirmed by Western 
blotting. The cells were cultured on ypd or synthetic complete media at 25°C for 24 
hours, serially diluted, and plated on SD-ADE media to screen for [PSt] conversions. As 
shown in Fi g ure 4. the spontaneouS-rate-of-appearance-of-[^iS7^]-CQlQnies-was-increased — 

30 about 5000-fold in cells carrying the repeat expansion (R2E2) compared to wild-type 
cells. The wild-type cells produced colonies on the selective medium at a frequency of 
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about 1 per million cells plated. The RA2-5 cells produced such colonies at even lower 
frequency, and it appears that none of these were attributable to development of a [PSr] 
phenotype, since they could not be cured by growth on medium containing 5 mM 
guanidine HCl. In contrast, growth of the wild-type and the R2E2 colonies on the 
5 selective medium could indeed be cured by the guanidine HCl treatment. 

In additional experiments, the effects of the Sup35 repeat variants were 
examined when they were used to replace the wild-type Sup35 gene in [PSI^] cells. Cells 
with the R2E2 replacement remained [PSr], whereas all cells carrying the RA2-5 
replacement became [psi-]. Thus, maintenance of the [PSr] phenotype requires a Sup35 

1 0 gene having more than one of the oligopeptide repeats. 

Still another series of tests examined the effects of the repeat variants on 
the structural transition of NM in vitro. When purified recombinant NM is denatured and 
diluted into aqueous buffers, it slowly changes from a random coil into a P-sheet rich 
structure and forms fibers that bind Congo red with the spectral shift characteristic of 

15 amyloid proteins. When deposited at high concentrations, the Congo red-stained fibers 

also show apple-green birefringence. To determine if the repeat variants alter the intrinsic 
capacity of the protein to fold in this form, the wild-type and two repeat variants were 
purified in fully denatured states and then diluted into a non-denaturing buffer. Structural 
changes were monitored by the binding of Congo red [Klunk et al, J. Histochem. 

20 Cytochem,, 37: 1293-1297 (1989)]and confirmed by circular dichroism and electron 

microscopy analysis. In these experiments, the R2E2 variant converted to a P-sheet rich 
structure about twice as quickly as the wild-type NM polypeptide, which in tum 
converted significantly faster than the RA2-5 variant. These differences were 
reproducibly obtained in both rotated and unrotated reactions, although the transition was 

25 slower in the unrotated reactions. This data indicates that alterations in the number of 

repeat units alters the propensity of Sup35 NM polypeptides to progress fi*om an unfolded 
state into a p-sheet rich, higher-ordered structure. 

The foregoing experiments demonstrate that the propensity of novel 

chimeric proteins to a ggregate_into_prion jike-fibrils can-be-modulated-bv-alteration of-the- 

30 SCHAG amino acid sequence of the chimera. Modulation of any SCHAG amino acid 
sequence in this manner is specifically contemplated as an aspect of the invention, as are 
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the resulting gene and protein products. In addition to alteration by adding or deleting 
oligopeptide repeat regions, alterations by adding or deleting larger regions is specifically 
contemplated as an aspect of the invention. By way of example, the entire N terminal 
region of Sup35 or Ure2 could be duplicated to increase the propensity of transformed 
5 cells to produce aggregated chimeric sequences. 

Example 4 

Demonstration that a prion can be moved from one organism to another 

The following experiments demonstrate that a prion protein from one 
organism will continue to behave in a prion-hke manner when recombinantly expressed 
10 in another organism, and can even do so when expressed in a different cellular 
compartment than that in which the protein is produced in its native host. 

Polynucleotides encoding mouse (SEQ ID Nos: 18 and 19) and Syrian 
Hamster (SEQ ED Nos: 20 and 21) PrP proteins were expressed in yeast cells under the 
control of the constitutive GPD promoter. The protein was produced in the yeast cytosol, 
1 5 without signal sequences that would normally guide it to the endoplasmic reticulum, and 
without the tail that is normally clipped off during maturation of these proteins in their 
native hosts, hi other words, the PrP protein product in yeast was similar to the final 
mature product in mammalian neurons, except that it did not contain the sugar 
modification and GPI anchor. There has been considerable data suggesting that these 
20 sugar and GPI anchor characteristics are not required for prion formation. 

The normal cellular form of PrP (PrP*^)is detergent soluble, but the 
conformational ly changed-protein that is characteristic of neurodegenerative prion disease 
states (PrP^'^) is insoluble in detergent such as 10% Triton. When PrP protein is expressed 
in yeast, is was insoluble in non-ionic detergents, suggesting that a PrP^*" form was 
25 present. 

PrP-transfected yeast cells were lysed in the presence of 10% Sarkosyl and 
centrifuged at 16,000 x g over a 5% sucrose cushion for 30 minutes. Proteins in both the 
supernatant and pellet fractions were an alyzed on SDS p olyacrylamide-fiels.-Coomassie — 
blue staining revealed that most proteins were soluble under these conditions and were 
30 present in the supernatant fraction. When identical gels were blotted to membranes and 



-59- 

reacted with antibodies against mammalian PrP, most of the PrP protein was found in the 
pellet fraction, further suggesting that a PrP^*" form was present in the yeast. 

Protease studies provide further evidence that the yeast PrP was adopting a 
PrP^*" conformation. When PrP protein is expressed in yeast it displays the same highly 
5 specific pattern of protease digestion as does the disease form of the protein in mammals. 
The normal cellular form of PrP is very sensitive to protease digestion. In the disease 
form, the protein is resistant to protease digestion. This resistance is not observed across 
the entire protein, but rather, the N-terminal region from amino acids 23 to 90 is digested, 
while the remainder of the protein is resistant. As expected, when PrP was expressed in 
10 the yeast cytosol it was not glycosylated, and it migrated on an SDS gel as a protein of 
-27 kD. After protease digestion, a resistant fragment of --19-20 kD was detected, 
corresponding exactly to the size expected if the protein were being cleaved at the same 
site as the PrP^*" form of the protein that can be recovered from diseased mammalian 
brains. 

15 The foregoing data indicates that, when mammalian PrP is expressed in 

yeast, a species from an entirely different taxonomic kingdom, it be behaves unlike 
common yeast proteins, and very much like the disease form of PrP in mammals. 

Besides the diseased form, a small portion of PrP protein expressed in 
yeast cytosol also behaves like the normal cellular form of PrP. Even after centrifugation 

20 at 180,000g for 90 minutes, there is still some PrP protein detectable in the supernatant 
fraction. This part of PrP expressed in yeast, like normal cellular PrP, was soluble in 
non-ionic detergent, suggesting this small portion of PrP is present in the PrP"" 
conformation. 

Example 5 

25 Assays to identify novel prion-like amyloidogenic sequences 

The following experiments demonstrate how to identify novel pri on-like 
amyloidogenic sequences and confirm their ability to form prions in vivo. The 

experiments-involve-(A)-identifying-sequenGes-suspeeted of-having-prion-for^^^ 

capability; and (B) screening the sequences to confirm prion forming ability. 
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A. Identifying sequences suspected of having prion forming capability 
Known prion or prion-like amino acid sequences, or polynucleotides 
encoding such sequences, are used to probe sequence databases or genomic libraries for 
similar sequences. For example, in one embodiment, a prion or prion-like amino acid 
5 sequence (e.g., a mammalian PrP sequence; the N or NM regions from a yeast Sup35 
sequence; or the N region from a yeast Ure2 sequence) is used to screen a protein 
database (e.g., Genbank or NCBI) using a standard search algorithm (e.g., BLAST 
1 .4.9.MP or more recent releases such as BLAST 2.0, and a default search matrix such as 
BLOSUM62 having a Gap existence cost of 1 1, a per-residue gap cost of 1, and a Lambda 

10 ratio of 0.85. See generally Altschul et aL, Nucleic Acids Res., 25(17): 3389-3402 

(1997).). As an exemplary cutoff, database hits are selected having P(N) less than 4 x 10' 
^, where P(N) represents the smallest sum probability of an accidental similarity. For 
database searching, polypeptide sequences are preferred, but it will be apparent that 
polynucleotides encoding the amino acid sequences also could be used to probe 

1 5 nucleotide sequence databases. 

In an alternative embodiment, one or more polynucleotides encoding a 
prion or prion-like sequence is amplified and labeled and used as a hybridization probe to 
probe a polynucleotide library (e.g., a genomic library, or more preferably a cDNA 
library) or a Northern blot of purified RNA for sequences having sufficient similarity to 

20 hybridize to the probe. The hybridizing sequences are cloned and sequenced to determine 
if they encode a candidate amino acid sequence. Hybridization at temperatures below^ the 
melting point (T^) of the probe/conjugate complex w^ill allow pairing to non-identical, but 
highly homologous sequences. For example, a hybridization at 60°C of a probe that has a 
T^ of yO'^C will permit -10% mismatch. Washing at room temperature will allow the 

25 annealed probes to remain bound to target DNA sequences. Hybridization at 

temperatures (e.g., just below the predicted T^ of the probe/conjugate complex) will 
prevent mismatched DNA targets from being bound by the DNA probe. Washes at high 
temperature will further prevent imperfect probe/sequence binding. Exemplary 
hybridization-eonditions-are as-follows:-hybridization-overnight-at-50°e-in-A^ 

30 [5X SSC (where IX SSC is 150 mM NaCl, 15 mM sodium citrate, pH 7), 5X Denhardt's 
solution, 1% sodium dodecyl sulfate (SDS), 100 |ig/ml single stranded DNA (salmon 
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sperm DNA)] with 10 ng/ml probe, and washing twice at room temperature for ten 
minutes with a wash solution comprising 2X SSC and 0. 1% SDS. Exemplary stringent 
hybridization conditions, useful for identifying interspecies prion counterpart sequences 
and intraspecies allelic variants, are as follows: hybridization overnight at 68°C in APH 
5 solution with 10 ng/ml probe; washing once at room temperature for ten minutes in a 
wash solution comprising 2X SSC and 0.1% SDS; and washing twice for 15 minutes at 
68°C with a wash solution comprising O.IX SSC and 0.1% SDS. 

In another alternative embodiment, known prion sequences or other 
SCHAG amino acid sequences are modified, e.g., by addition, deletion, or substitution of 
10 individual amino acids; or by repeating or deleting motifs known or suspected of 

influencing fibril-forming propensity. To form novel prion sequences, modifications to 
increase the number of polar residues (glutamine, asparagine, sorine, tyrosine) are 
specifically contemplated, with modifications that increase glutamine and asparagine 
content being highly preferred. [See Depace et al. Cell 95:1241-1252 (1998), 
1 5 incorporated herein by reference.] In a preferred embodiment, the alterations are effected 
by site directed mutagenesis or de novo synthesis of encoding polynucleotides, followed 
by expression of the encoding polynucleotides. 

In yet another alternative embodiment, antibodies are generated against the 
prion forming domain of a prion or pri on-like protein, using standard techniques. See, 
20 e.g.^ Harlow and Lane, Antibodies, A Laboratory Manual, Cold Spring Harbor 

Laboratory, Cold Spring Harbor, NY (1988). The antibodies are used to probe a Western 
blot of proteins for interspecies counterparts of the protein, or other proteins that possess 
highly conserved prion epitopes. Candidate proteins are purified and partially sequenced. 
The amino acid sequence information is used to generate probes for obtaining an 
25 encoding DNA or cDNA from a genomic or cDNA library using standard techniques. 

Sequences identified by the foregoing techniques can be further evaluated 
for certain features that appear to be conserved in prion-like proteins, such as a region of 
50 to 150 amino acids near the protein's amino-terminus or carboxyl-terminus that is rich 

in_glycine,.glutamine,-and-asparagine, and-possibly-the-polar-residues-serine and tyro^ 

30 which region may contain several oligopeptide repeats and have a predicted high degree 
of flexibility (based on primary structure). In the case of Sup35, a highly charged domain 
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separates the flexible N-temiinal region having these properties from the functional C- 
terminal domain. Sequences possessing one or more of these features are ranked as 
preferred prion candidates for screening according to techniques described in the 
following section. 

5 By way of example, the Genbank protein database (accessible via the 

worldwide web at www.ncbi.nlm.nih.gov) was screened using the Basic Local Alignment 
Search Tool (BLAST) program (version 1 .4.9) using the standard (default) matrix and 
stringency parameters (BLOSUM62). The prion forming domains of Ure2 (Genbank 
Acc. No. M35268, SEQ ID NO: 4, amino acids 1-65) and Sup35 (Genbank Acc. No. 
10 M21 129, SEQ ED NO: 2, amino acids 1-1 14) from S. cerevisiae were used as BLAST 
query sequences. Open reading frames (ORPs) from S. cerevisiae with high similarity 
scores [P(N) less than 4 x 1 0"^] resulting from the initial search included the following 
Genbank database entries: 

M (1) residues 53-97 from Accession No. Z73582 (SEQ ID NO: 22), an 

I J 15 uncharacterized open reading from S,cerevisiae; 

^ (2) residues 1030-1071 from PID No. e236901, in Accession No. Z71255 

□ (SEQ ID NO: 23), an uncharacterized open reading fi-om S,cerevisiae; 

p (3) residues 4-58 from locus ybm6, Accession No. P38216 (SEQ ID NO: 24), 

an uncharacterized open reading from S. cerevisiae; 
Q 20 (4) residues 251-380 fi-om locus hrpl. Accession No. U35737 (SEQ ID NO: 

25) , an RNA binding and transport protein having homology to hnRNPl in 
humans. 

(5) residues 28-126 from locus npl3. Accession No. U33077 (SEQ ID NO: 

26) , an RNA binding and transport protein that functions genetically in the 
25 same pathway as Hrp 1 ; 

(6) residues 97-286 from locus mcml, Accession No. X14187 (SEQ ID NO: 

27) , a DNA binding protein active in cell cycle regulation and mating-type 
specificity; 

(-7-) residues-205-414-from-locus-nsrl— Accession-No-P274-76-(SEQ-ID-NO: 

30 28), a protein that binds nuclear localization sequences and is active in 

mRNA processing; 
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(8) residues 153-405 from Accession No. P25367 (SEQ ED NO: 29), an 
uncharacterized open reading frame; 

(9) residues 806-906 from Accession No. P40467 (SEQ ED NO: 30), an 
uncharacterized open reading frame; 

(10) residues 605-677 from Accession No. S54522 (SEQ ID NO: 3 1), an 
uncharacterized open reading frame; 

(11) residues 100-300 from locus yk76. Accession No. P36168 (SEQ ID NO: 

32) , an uncharacterized open reading frame; 

(12) residues 1 to 250 from locus fpsl, Accession No. S16712 (SEQ ID NO: 

33) , a membrane channel protein that controls passive efflux of glycerol; 

(13) residues 334-388 from Accession No. p40002 (SEQ ED NO: 34), an 
uncharacterized open reading frame; 

(14) residues 325-375 from locus madl. Accession No. P40957 (SEQ ED NO: 

35) , an uncharacterized open reading frame; and 

(15) residues 215-284 from locus karl. Accession No. M15683 (SEQ ID NO: 

36) , an uncharacterized open reading frame. 

The nuclear polyadenylated RNA-binding protein hrpl (Genbank 
Accession No. U35737) is an especially promising prion candidate. It is the clear yeast 
homologue of a nematode protein previously cloned by cross-hybridization with the 
human PrP gene; it scored highly (p value 3.9 e-5) in a Genbank BLAST search for 
sequences having homology to the N-terminal domain of Sup35; and it contains a stretch 
of 130 amino acids at its C-terminus that is glyine- and asparagine-rich and contains 
repeat sequences similar to the oligomeric repeats in the N-terminal domain of Sup35; 
and is predicted by secondary structure programs to consist entirely of turns. 

The sequence corresponding to residues 153-405 of SEQ ID NO: 29 
comprises another promising prion candidate. This region is rich in glutamine and 
asparagine, and is part of a protein that is normally found in aggregates in yeast although 
it is not aggregated in some strains. When expressed as a fusion protein with green 

fluoresc-ent_proteiii,_this-sequence causes-the-GFP-to-aggTegate^^his-a^ 

completely dependent upon Hspl04, much the same as Sup35 aggregation. When 
residues 153-405 of SEQ ID NO: 29 are substituted for the NM region of SUP35 and 
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transformed into [psi-] yeast, the yeast exhibit a suppression phenotype analogous to 

[psri 

B. Screening sequences to confirm prion-forming capability . 

Sequences identified according to methods set forth in Section A are 
screened to determine if the sequences represent/encode proteins having the ability to 
aggregate in a prion-hke manner. 

1. Aggregation assay using fusion proteins 

In a preferred screening technique, a polynucleotide encoding the ORF of 
interest is amplified fi*om DNA or RNA fi-om a host cell using polymerase chain reaction, 
or is synthesized using the well-known universal genetic code and using an automated 
synthesizer, or is isolated fi-om the host cell of origin. The polynucleotide is ligated in- 
fi-ame with a polynucleotide encoding a marker sequence, such as green fluorescent 
protein or firefly luciferase, to create a chimeric gene. In a preferred embodiment, the 
polynucleotide is ligated in fi-ame with a polynucleotide encoding a fusion protein such as 
a Bleomycin/luciferase fusion, which would permit both selection for drug-resistance and 
quantification of soluble and insoluble proteins by enzymatic assay. See, e,g., Elgersma 
etaL, Genetics, 135: 731-740(1993). 

The chimeric gene is then inserted into an expression vector, preferably a 
high-copy vector and/or a vector with a constitutive or inducible promoter to permit high 
expression of the ORF-marker fusion protein in a suitable host, e.g., yeast. The 
expression construct is transformed or transfected into the host, and transformants are 
grown under conditions that promote expression of the fusion protein. Depending on the 
marker, the cells may be analyzed for marker protein activity, wherein absence of marker 
protein activity despite the presence of the marker protein is correlated with a likelihood 
that the ORF has aggregated, causing loss of the marker activity. Alternatively, host cells 
or host cell lysates are analyzed to determine if the fusion protein in some or all of the 
cells has aggregated into aggregates such as fibril-like structures characteristic of prions. 
-The-analysis-is Gonducted using-one-or-more-standard-teehniquesrincluding-microscopic — 
examination for fibril-like structures or for coalescence of marker protein activity; 
analysis for sensitivity or resistance to protease K; spectropolarimetric analysis for 
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circular dichroism that is characteristic of amyloid proteins; and/or Congo Red dye 
binding. 

A number of the candidates identified above were screened in this manner 
using a GFP fusion construct. To create the vector that was employed in these analyses, a 
copper inducible Cupl promoter was ampHfied from a genomic library by standard 
polymerase chain reaction (PGR) methods using the primers 5'- 
GG GAATTC CC ATT ACCGAC ATTTGGGCGC-3 ' (SEQ ID NO: 37) and 5'- 
GGGGATCCTGATTGATTGATTGATTGTAC-3' (SEQ ID NO: 38), digested with the 
restriction enzymes EcoRI and BamHI, and Ugated into the pRS316 vector that had 
digested with EcoRI and BamHI. The annealed vector, designated pRS316Cupl, was 
transformed into E, Coli strain AG-1, and transformants were selected using the 
ampicillin resistance marker of the vector. Correctly transformed bacteria were grown 
overnight to provide DNA for further vector construction. 

Next, a sequence encoding superbright GFP (SEQ ID NOs: 39, 40) was 
inserted into the pRS316Cupl vector. Superbright GFP was amplified from pPSGFP 
using the primers 5'-GACCGCGGATGGCTAGCAAAGGAGAAG-3' (SEQ ID NO: 41) 
and 5'-CCTGAGCTCTCATTTGTATAGTTCATCC-3' (SEQ ID NO: 42). The resultant 
PCR products were digested with Sad and SacII and inserted into PRS316Cupl that also 
had been digested ed with Sad and SacII. This created a pRS316CuplGFP plasmid into 
which a polynucleotide encoding a candidate open reading frame could be inserted for 
expression studies. In particular, it was contemplated that candidate open reading frames 
be amplified by PCR from genomic DNA or cDNA using primers engineered to contain 
BamHI and SacII restriction sites, to permit rapid cloning into the BamHI and SacII sites 
of the deri ved PRS3l6CupI GFP vector. For example, in the case of open reading frame 
(ORF) P25367 the following primers were used: 5'- 

GGA GGATCC ATGGATACGGATAAGTTAATCTCAG-3^ (SEQ ID NO: 43, BamHI 
site underiined) and 5 '-GGA CCGCGG GTAGCGGTTCTGTTG AGAAAAGTTGCC-3 ' 
(SEQ ID NO: 44, SacII site underlined). PCR products were digested with BamHI and 

— SaeII-and-inserted4nto-the-derived-plasmid— This-created a plasmid-that-can-inducibly 

express a fusion of an open reading frame of interest fused to GFP. The sequence of 
pRS316-Cupl-p25367-GFP is set forth in SEQ ID NO: 45. 



-66- 

2. In vitro aggregation assay using chaperone protein 

A polynucleotide encoding the ORF of interest is synthesized using the 
well-known universal genetic code and using an automated synthesizer, or is isolated 
from the host cell of origin, or is amplified using polymerase chain reaction from DNA or 
5 RNA from such a host cell, fri a preferred embodiment, the polynucleotide frirther 
includes a sequence encoding a tag sequence, such as a polyhistidine tag, HA tag, or 
FLAG tag, to facilitate purification of the recombinant protein. The polynucleotide is 
inserted into an expression vector and expressed in a host cell compatible with the 
selected vector, and the resultant recombinant protein is purified. 
10 Serial dilutions of the recombinant polypeptide {e.g,, 100 mM, 10 mM, 1 

mM, 0.1 mM, 0.01 mM final concentration) are mixed with 1 |i,g of a chaperone protein 
such as yeast Hspl04 protein [See Schirmer and Lindquist, Meth. Enzymol,, 290: 430-444 
(1998)] in a low sah buffer (e.g., 10 mM MES, pH 6.5, 10 mM MgS04) containing 5 mM 
ATP in a 25 |ll reaction volume. As controls, reactions are performed in parallel using 
15 buffer alone or using Sup35 protein. Reactions are incubated at 37°C for eight minutes, 
and the ATPase activity of the chaperone protein is measured by determining released 
phosphate, e.g., using Malachite Green [Lanzetta et al.Analyt. Biochem., J 00: 95-97 
(1979)]. In this assay, several fibril-aggregation proteins, including yeast Sup35, the yeast 
Sup35 N terminal domain, mammalian PrP protein, and P-amyloid (1-40) and (1-42) 
20 forms, were found to inhibit the ATPase activity of Hspl04; whereas control proteins 
(aldolase, BSA, apoferritin, and IgM) did not. 
3. Assav results 

To determine if the proteins represented by the ORF's identified above in 
part A were aggregation prone, a hallmark of prions, polynucleotides encoding the 
25 specified residues of interest within the ORF's were amplified from S. cerevisiae genomic 
DNA via PGR and ligated in-frame to a sequence encoding superbright, as described 
above in section B. 1 . 

These plasmids were transformed into the yeast strain 74D (a, his, met, 

leu,-ura,-ade) —Tran s form an t-coloni es-were-se 1 eet ed-(ura+)-and-i no cu 1 at ed i n t o l i q ui d S D 

30 ura and grown to early log phase. Copper sulfate was added to the cultures (final 
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concentration 50|aM copper) to induce protein expression. Cells were fixed after four 
hours of induction and intracellular GFP expression was visualized. 

Examination of GFP fluorescence revealed that the sGFP tag had 
coalesced in transformants expressing six of the ORF's. This coalescence was similar to 
5 that observed with Sup35-GFP fusions in [PSJ^] yeast and was considered to be indicative 
of an ORP having prion-like aggregate-forming ability. Two of the positive sequences 
represent uncharacterized open reading frames: Z73582 and ybm6. Four are known 
proteins: mcml, fpsl, p25367 and hrpl as described above in section B.l . Aggregation 
of the MCMl-GFP fusion was relatively rare, and was not influenced by Hspl04 dosage 

10 in the cells. Of particular interest was the hrpl construct, which aggregated into multiple 
cytoplasmic points in the transformed S. cerevisiae, and also in transformed C. elegans. 
Deletion of the Hspl04 gene was shown to eliminate the aggregation pattem of hrpl. 
Also of special interest was the aggregation pattem of the P25367 construct, because this 
aggregation was completely eliminated by overexpression of Hspl04. 

1 5 The foregoing experiments demonstrate that searches with prion forming 

sequences will identify additional sequences with prion-like properties, which sequences 
can be used according to various aspects of the invention that are specifically exemplified 
herein with respect to Sup35 or IJRE2 sequences. 

The abihty of newly identified aggregating proteins to exist in both an 

20 aggregating and non-aggregating conformational state can be further examined, if desired, 
by studying aggregation phenomena in host cells expressing varying levels of the protein 
(a result achieved using an inducible promoter, for example), and in host cells having 
normal and over- or under-expressed chaperone protein levels. (The ability of Sup35 in 
yeast to enter a [PST] conformation depends on an appropriate intermediate level of the 

25 chaperone protein Hspl04; elimination of Hspl04 or over-expression of Hspl04 causes 
loss of [PSr] and prevents de novo appearance of {PSt^, See Chemoff et aL, Science, 
268\ 880 (1995) and Patino et aL, Science, 273: 622-626 (1996). Growth on a mildly 
denaturing media, as described elsewhere herein, provides another alternative assay. 
The-foregoing-assaysrchimericconstructsrand-candidate-SeHAG-amino — 

30 acid sequences are all intended as aspects of the invention. 
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Example 6 

Identification of Rnql as an epigenetic modifier of protein function in yeast 

The following experiments demonstrate that putative prions can be 
identified by searching for three attributes of the known yeast prion proteins: unusual 
5 amino-acid composition with a high concentration of the polar amino-acid residues 
glutamine and asparagine, constant expression levels through log and stationary phase 
growth, and a capacity to switch between distinct stable physical states (in this case, 
insoluble and soluble forms). One of the candidates isolated in this search, Rnql, has both 
in vitro and in vivo characteristics of a prion. Rnql, exists in distinct, heritable physical 

10 states, soluble and insoluble. The insoluble state is dominant and transmitted between 
cells through the cytoplasm. When the prion-like region of Rnql was substituted for the 
prion domain of Sup35, the protein determinant of the prion [PSI^], the phenotypic and 
epigenetic behavior of [PSI^^ was fully recapitulated. These findings identify Rnql as a 
prion, demonstrate that prion domains are modular and transferable, and establish a 

15 paradigm for identifying and characterizing novel prions. 



A. Identification of prion candidates 

The characteristics of Sup35 and Ure2 suggested several criteria for 
identifying new prion candidates. Previous experiments have demonstrated that particular 
regions (residues 1-65 for Ure2 (Genbank Acc. No. M35268, SEQ ID NO: 4) and 
20 residues 1-123 for Sup35 (Genbank Acc. No. M21 129, SEQ DI NO: 2)) are critical for 
prion formation by these proteins. Over-expression of these regions is sufficient to 
induce the prion phenotype de novo. Deletion of these regions has no effect upon the 
normal cellular function of the proteins but prevents them fi-om entering the prion state. 
These critical prion-determining domains have an unusually high concentration of the 
25 polar residues glutamine and asparagine and are predicted to have very little secondary 

structure. The domains are located at the ends of proteins that have an otherwise ordinary 
amino acid composition. We hypothesized that by searching for open reading frames 

^with-these eharaeteristics-we-might-find new-prion proteins: 

A BLAST search (1.4.9MP version) of the NCBl database of non- 
30 redundant coding sequences was performed using the prion-determining domains of Ure2 
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and Sup35 (residues 1-65 of SEQ ID NO: 4 and residues 1-123 of SEQ ED NO: 2, 
respectively) as the query sequence with the following parameters: V=100, B=50, H=0, 
S=90, and P=4. This search revealed approximately twenty open reading frames that had 
prion-like domains appended to polypeptides with an otherwise normal amino acid 
composition. To restrict the number of likely candidates, we took advantage of recent 
global descriptions of mRNA expression patterns. In examining this data we noted that 
Sup35 and Ure2 are expressed at nearly constant levels as cells transit from the log to the 
stationary phase of growth. Large fluctuations in expression would be inconsistent with 
the stability of both their heritable prion and non-prion states. The open reading frames 
from the BLAST search whose expression varies by less than two-fold in the log phase 
transition were selected for further analysis. They were fused to the coding sequence of 
green fluorescent protein (GFP) using PGR and expressed in the yeast strain 74D-694 
(adel-14, trpl'289, his3A-200, ura3-52, leu2-3, lys2). Three of the proteins, RNQl 
(Genbank Acc. No. NP009902, SEQ ID NO: 50), YBR016w (Genbank Acc. No. 
NP009572, SEQ ID NO: 51), and HRPl (Genbank Acc. No. NP014518, SEQ ID NO: 
52), showed coalescence of GFP, as previously described for Sup35. 

B. Rnql exists in distinct states controllable by Hspl04 

We next asked if expression of the fusion protein in a strain that lacked the 
chaperone Hspl04 eliminated the coalescence of GFP, as it does for Sup35-GFP fusions. 
This is not a necessary criterion for prion proteins (an interaction with Hspl04 has not 
been demonstrated for [URE3]) but interaction with the chaperone provides a useful tool 
for further analysis. In wild-type yeast, fluorescence from the Rnql -GFP fusion was 
found in one or more small, intense, cytoplasmic foci. When the fusion protein was 
expressed in the isogenic Dhspl04 strain, fluorescence was diffuse. The C-terminal end 
of Rnql (amino acids 153-405 of SEQ ID NO: 50) contained the region rich in glutamine 
and asparagine residues. Fusion of this region alone to GFP gave an identical result to 
that seen with the full length Rnql -GFP fusion. Since the effect of HSP104 deletion upon 
the-Goalescenee-of^the-Rnq l-fusion was the most dramaticrit'was chosen for furt^ 
analysis. 
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Differential centrifugation was employed to detennine if the coalescence 
observed with Rnql-GFP fusion proteins reflected the behavior of the endogenous Rnql 
protein. Log phase yeast were lysed using a bead beater (Biospec) into 75niM Tris-Cl 
(pH7), 200mM NaCl, 0.5 mM EDTA, 2.5% glycerol, 0.25mM EDTA, 0.25% Na- 
5 deoxycholate, supplemented with protease inhibitors (Boehringer-Mannheim). Lysates 
were cleared of crude cellular debris by a 15 second 6000 RPM spin in a microcentrifuge 
(Eppendorf). Non-denatured total cellular lysates were fractionated by high-speed 
centrifugation into supernatant and pellet fractions using a TLA- 100 rotor on an Optima 
TL ultracentrifuge (Beckman) at 280,000 x g (85,000 RPM) for 30 minutes. Protein 

10 fractions were resolved by 10% SDS-PAGE and immunoblotted with an a-Rnql 

antibody. Rnql remained in the supernatant of a Ahspl04 strain, but pelleted in the wild- 
type. Thus, the GFP coalescence is not an artifact of the fusion; the Rnql protein itself is 
sequestered into an insoluble aggregate in an Hsp 1 04-dependent fashion. We also 
examined the solubility of Rnql in several unrelated yeast strains. In four (S288c, 

15 YJM436, SKI and W303) the protein fractionated in the pellet, in two (YJM128, 

YJM309) it partitioned between the pellet and supernatant fractions, and in two others 
(33G, 10B-H49) the protein was chiefly recovered in the supernatant fraction. Thus, 
Rnql naturally exists in distinct physical states in different strains. 

C. The insoluble state of Rnql is transmitted by cytoduction 

20 The heritability of the known yeast prions is based upon the ability of 

protein in the prion state to influence other protein of the same sequence to adopt the 
same state. Because the protein is passed from cell to cell through the cytoplasm, the 
conformational conversion is heritable, dominant in crosses, and segregates in a non- 
Mendelian manner. To determine if the insoluble state of Rnql is transmissible in this 

25 way, we used cytoduction, a well-established tool for the analysis of the [PST^^ and 
[URE3] prion. The karyogamy deficient (karl-1) strain 10B-H49 {p°ade2-l, lysl-l, 
his3-n J 5, Ieu2-3J 1 2, karl-1, ura3::KANR) can undergo normal conjugation between a 

andjajcjs|]s_butJS-unable-to-fuse-its-nuGleus-with-i^^^ 

and organelles are mixed in fused cells, but the haploid progeny that bud from them 

30 contain nuclear information from only one of the two parents. 
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10B-H49 shows diffuse expression of Rnql-GFP, and served as the 
recipient for the transfer of insoluble Rnql from W303 (Mata, his3-l Ieu2-3J 12, 
trpl-1, ura3-], ade2-l), the donor. After cytoduction, colonies derived from haploid cells 
that contained the 10B-H49 nuclear genome but had undergone cytoplasmic mixing, as 
5 demonstrated by mitochondrial transfer, were selected. Cytoductants were selected after 
overnight mating on defined media lacking tryptophan that had glycerol as the sole carbon 
source. All showed single or multiple cytoplasmic aggregates of Rnql-GFP - a pattern 
indistinguishable from that of the W303 parent. Furthermore, density-based 
centrifugation of protein extracts, performed as above, indicated that cytoduction caused 

10 the endogenous Rnql protein of the 10B-H49 strain to shift from the soluble to the 
insoluble fraction. Thus exposure of 10B-H49 cells to the cytoplasm of W303 is 
sufficient to cause a heritable change in the physical state of Rnql. Because RNQl is a 
nuclear gene (not transmitted during cytoduction) the protein's insoluble state is not due 
to polymorphisms in its amino acid sequence, nor to any other trait carried by the W303 

15 genome. Rather, like the Sup35 and Ure2 prions, its altered conformational state is 
"infectious", transmissible from one protein to another. 

D. Purified Rnql forms fibers and shows seeded polymerization 

Both Sup35 and Ure2 have the capacity to form highly ordered amyloid 

20 fibers in vitro, as analyzed by the binding of amyloid specific dyes and by electron 

microscopy. To examine conformational transitions of Rnql in vitro, the protein was 
expressed in E. coli and studied as a purified protein. Rnql was cloned into pPROEX- 
HTb (GibcoBRL). The primers 5'-GGA GGA TCC ATG GAT ACG GAT AAG TTA 
ATC TCAG-3 ' (SEQ ID NO: 53) and 5'-CC AAG CTT TCA GTA GCG GTT CTG TTG 

25 AGA AAA GTTG-3' (SEQ ID NO: 54) were used for PGR in a solution containing 1 0 
mM Tris (pH8.3), 50 mMKCl, 2.5 mM MgCl2,2 mM dNTPs, 1 |J,M of each primer and 2 
U of Taq polymerase; and using genomic 74D DNA as template under the following 
conditions: incubation at 94 °C for 2 min, followed by 29 cycles of 94 °C for 30 sec, 50'^C 

foLl0_sec,-and-7-2-^C-for-90-seG5-followed-by-a-final-incubation-at-72-°C-for-10 minutes: 

30 The PGR product was then digested and ligated into the BamHl and Hindlll sites of 

pPROEX-HTb (GibcoBRL). The plasmid was electroporated into BL21-DE3 laclq cells. 
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Transformed bacterial cultures were induced at OD^oo = 1 with 1 mM IPTG for four hours 
at 30°C, The cells were lysed in 8M urea (Rnql was purified under denaturing conditions 
(8M urea) because it had a tendency to form gels during purification in the absence of 
denaturant), 20mM Tris-Cl pH8. Protein was purified over a Ni-NTA column (Qiagen) 
5 followed by Q-sepharose (Pharmacia). The (His)6-tag from the vector was cleaved under 
native conditions (150mM NaCl, 5 mM KPi) using TEV protease followed by passage of 
the protease product over a Ni-NTA column to remove uncleaved protein. Protein was 
methanol precipitated prior to use. Recombinant protein was resuspended in 4M urea, 
150mM NaCl, 5 mM KPi, pH 7.4 at a concentration of 10 [iM. Seeded samples were 

10 created by sonication of 1/50 volume of a 10|iM solution of pre- formed fibers verified by 
electron microscopy. The protein samples were incubated at room temperature on a 
wheel rotating at 60 r.p.m. 

To determine if Rnql forms amyloids we used Thioflavin T fluorescence. 
This dye exhibits an increase in fluorescence and a red-shift in the of emission upon 

15 binding to multimeric fibrillar P-sheet structures characteristic of many amyloids, 

including transthyretin, insulin, p-2 microglobulin and Sup35. Fluorimeter samples 
were prepared as 3.3|aM Rnql, 50|aM Thioflavin T in buffer. Samples were analyzed on 
a Jasco FP750 with the following settings: X^^^^ 409nm, 484nm, bandwidth lOnm. 
The acquisition of Thioflavin T binding was sigmoidal (lag phase six) suggesting a self- 

20 seeded process of protein assembly. The addition of 2% preformed fibers to fresh 
solutions of Rnql reduced the lag time - from 6.4±0.2 hrs to 4.3±0.2 hrs (n=4). 

The formation of higher ordered structures was confirmed by transmission 
electron microscopy. For electron microscopy analysis, 5|ll of a lOflM protein solution 
was placed on a 400 mesh carbon coated EM grid (Ted Pella, Cat. 01822), and allowed to 

25 adsorb for 1 minute. The sample was negatively stained with 200|al of 2% aqueous 

uranyl acetate, and wicked dry. Samples were observed in a Philips CM120 transmission 
electron microscope operating at 120kV in low dose mode. Micrographs were recorded at 
a magnification of 45,000 on Kodak SO- 163 film. The protein formed fibers with a 
diarneter_ofJT.3„±_L.4iUTi. Jhis-figureis-comparable-to-the-repo 

30 nm) and Sup35 (-17 nm) fibers. The fibers appeared to be branching and the termini were 
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unremarkable. The appearance of the fibers was coincident with the onset of rapid 
increases in Thioflavin T fluorescence. 



E. Rnql Disruption 

[URE3] and [PST^] produce phenotypes that mimic loss-of-function mutations in 
5 their protein determinants. To determine the loss of function phenotype of Rnql, the 

entire ORF was deleted by homologous recombination in a diploid 74D-694 strain using a 
kanamycin resistance gene. Strains deleted of the Rnql open reading frame were created 
using the long flanking homology PGR method. Primers 5'-GGT GTC TTG GCC AAT 
TGC CC-3' (SEQ ID NO: 55) and 5'-GTC GAG GTG GAG GGT AGG GAT TTG AGA 

10 TGT TTG GTA TAG-3' (SEQ ID NO: 56) or 5'-CGA GGT CGA ATT GAT CGA TTG 
ATT GAG TTG GGG TTG TATG-3' (SEQ ID NO: 57) and 5'-GTG TTT TGA AAG 
GGT GCA GATG-3' (SEQ ID NO: 58) were used to amplify genomic DNA. These PGR 
products were used as primers for a second round of PGR on plasmid pFA6a, which is 
described in Wach et al., Yeast 13:1065-75 (1994), digested with Notl. The product of 

15 the second PGR round was used to transfomi log-phase yeast cultures. Transformants 

were selected on YPD containing 200 mg/mL G418 (GibcoBRL), Upon sporulation each 
tetrad produced four viable colonies, two of which contained the Rnql disruption, 
confirmed by immunob lotting total cellular proteins with an a-Rnql antibody and PGR 
analysis of the genomic region. The isrnql strain had a growth rate comparable to that of 

20 wild-type cells on a variety of carbon and nitrogen sources and was competent for mating 
and sporulation. The strain grew similarly to the wild-type in media with high and low 
osmolarity, and in assays testing sensitivity to various metals (cadmium, cobalt, copper). 

F. Fusion of Rnql (153-405) to Sup35 (124-685) - nonsense suppression 
25 phenotype 

The lack of an obvious loss-of-function phenotype was not unexpected, as 
the two known yeast prions, \URE3\ and {PSfl only exhibit phenotypes under unusual 
selectiye-conditions.— However^-the-absenee-of^a-phenotype presented-difficulties in 



determining whether Rnql could direct the epigenetic inheritance of a trait. To determine 
30 if the prion-like domain of Rnql could produce an epigenetic loss-of-function phenotype 
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we asked if it could replace the prion-determining domain of Sup35. When the wild-type 
Sup35 translation termination factor enters the prion state the loss-of-function phenotype 
it produces is nonsense suppression - the readthrough of stop codons. This phenotype can 
be conveniently assayed in the strain 74D-694 because it contains a UGA stop codon in 
5 the ADEl gene. In [psi'] 74D-694 cells, ribosomes efficiently terminate translation at this 
codon. Cells are therefore unable to grow on media lacking adenine (SD-ade), and 
colonies appear red on rich media due to the accumulation of a pigmented by-product, hi 
[PSt^ strains, sufficient readthrough occurs to support growth on SD-ade and prevent 
accumulation of the pigment on rich media. 

10 The coding region for amino acid residues 153-405 of Rnql (amino acid 

residues 153-405 of SEQ ID NO: 50) was substituted for 1-123 of Sup35 and the resulting 
fiision gene, RMQ was inserted into the genome in place of the endogenous SUP35 gene. 
RNQl, SUP 3 5 and its promoter were cloned by amplification of 74D-694 genomic DNA. 
The RNQl open reading frame was cloned using 5'-GGA GGA TCC ATG GAT ACG 

15 GAT AAG TTA ATC TCAG-3' (SEQ ID NO: 59) and (A) 5'-GGA CCG CGG GTA 

GCG GTT CTG TTG AGA AAA GTT GCC-3' (SEQ ID NO: 60). RNQl (153-405) was 
cloned using 5'-GA GGA TCC ATG CCT GAT GAT GAG GAA GAA GAC GAGG-3' 
(SEQ ID NO: 61) and (A). The SUP35 promoter was cloned using 5'-CG GAA TTC 
CTC GAG AAG ATA TCC ATC-3' (SEQ ID NO: 62) and 5'-G GGA TCC TGT TGC 

20 TAG TGG GCA GA-3'(SEQ ID NO: 63 ), SUP35 (124-685) was cloned using 5'-GTA 
CCG CGG ATG TCT TTG AAC GAC TTT CAA AAGC-3' (SEQ I DNO: 64) and 5'- 
GTG GAG CTC TTA CTC GGC AAT TTT AAC AAT TTT AC-3' (SEQ ED NO: 65) by 
PCR using the conditions described above in section D. 

The RMC gene replacement was performed as described in Rothstein, 

25 1991 . To create the plasmid for pop-in/pop-out replacement in pRS306 (available from 
ATCC), the SUP35 promoter was ligated into the EcoRI-BamHI site, RNQl (153-405) 
was ligated into the BamHI-SacII site, and SUP35 (124-685) was ligated into the SacH- 
SacI site. To create the disrupting fragment, this plasmid was linearized with Mlul and 
transfonried.— Pop-outs-were-seJeGted-on^-F0A-(Diagnostic-Ghemicals-Ltd7) and'verified~ 

30 by PCR. The resulting strain, RMC, had a growth rate similar to that of wild-type cells on 
YPD, although the accumulation of red pigment was not as intense as seen in ] 
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strains. RMC strains showed no growth on SD-ade even after 2 weeks of incubation). 
Thus, the protein encoded by the RMC gene (Rmc) fiilfilled the essential translational 
termination function of Sup35. 

At a low frequency, RMC variants appeared that were white on rich media 
5 and grew on SD-ade even more robustly than [PSr] cells did. The frequency at which 
these variants appeared (-10"^ ) was far greater than expected for reversion of the UGA 
stop codon mutation in adel-14, and subsequent analysis demonstrated that the allele had 
not reverted. The suppressor phenotype of these variants was comparable in stability to 
that of Because Sup35 proteins that lack residues 1-123 are incapable of making 

10 such conversions, these observations suggest that the Rnql prion-like domain can direct a 
prion conversion in the Rmc fusion protein. 

Transient over-expression of Sup35 can produce new [P57^] elements, 
because higher protein concentrations make it more likely that a prion conformation will 
be achieved. To test whether over-expression of Rmc can produce heritable suppressing 

1 5 variants, the original, non-suppressing RMC strain was transformed with an expression 
plasmid for RMC. These transformants showed a greatly elevated frequency of 
conversion to the suppressor state compared to control strains carrying the plasmid alone. 
Once a prion conformation is achieved it should be self-perpetuating and normal 
expression should then be sufficient for maintenance. When the RMC expression plasmid 

20 was lost all strains retained the suppressor phenotype. Thus, transient over-expression of 
Rmc produced a heritable change in the fidelity of translation termination. 



G. Non-Mendelian segregation of Rmc-based suppression phenotype 

To examine the genetic behavior of the suppressor phenotype in RMC 
strains, an isogenic a mating partner was created from a non-suppressing a RMC strain. 
25 When this strain was crossed to the original, non-suppressing, RMC strain, neither the 

diploids nor their haploid meiotic progeny exhibited the suppressor phenotype. However, 
when this strain was mated to RMC suppressor strains, the resulting diploids all displayed 

the_suppressor_phenotype,-demonstrating that-suppression-is dominant— In fourteen tetrads 

dissected from two different diploids of this cross, all four haploid progeny showed 
30 inheritance of the suppression phenotype, instead of the 2:2 segregation expected for a 
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phenotype encoded in the nuclear genome. Following convention, we henceforth refer to 
the dominant, non-Mendelian suppressor phenotype as [RPS^] (for Rnql [PiS7^]-like 
Suppression) and the non-suppressed phenotype as [rps ]. 

To determine if the dominant, non-Mendelian [T?/^!?^] phenotype arises 
5 from the ability of Rmc protein to form a prion, we tested it for two additional unusual 
genetic behaviors that are not expected for other non-Mendelian genetic elements, such as 
viruses or mitochondrial genomes. First, it should become recessive and Mendelian in 
crosses to strains carrying a wild-type Sup35 allele. This is because Sup35 lacks the 
Rnql sequences that would allow it to be incorporated into an [iiPiS^] prion. Wild-type 

10 Sup35, therefore, should cover the impaired translation-termination phenotype associated 
with the [RPS^] prion. However, even when this phenotype has disappeared, Rmc protein 
in the prion state should still convert new Rmc protein to the same state. Therefore, in 
haploid meiotic progeny of this diploid, the phenotype will reappear in segregants 
carrying the RMC gene, but not in segregants carrying the SUP35 gene (2:2 segregation). 

15 Indeed, diploids of a cross between an [/JPiS^] strain and an isogenic strain 

with a wild-type SUP35 gene did not exhibit a suppressor phenotype. Upon sporulation, 
suppression reappeared in only two of the four progeny. By PGR genotyping, these 
strains had the RMC gene at the SUP35 locus. Thus the [RPS^] factor had been preserved 
in the diploid, even though the phenotype had become cryptic, 

20 Second, maintenance of [TJPiS^] should depend upon continued expression 

of the Rmc protein. Although [7?PiS'^] is maintained in a cryptic state in diploids with a 
wild-type Sup35 gene, it should not be maintained in their haploid progeny whose only 
source of translational termination factor is wild-type Sup35. To determine if these 
progeny harbored the element in a cryptic state, they were mated to an [r/?^'] RMC 

25 strain whose protein would be converted if [RPS^] were still present. When this diploid 
was sporulated, none of the progeny exhibited the suppressor phenotype. Thus, the 
{RPS^^ element was not maintained in a cryptic state unless the Rmc protein was present. 

H. Curing_of rgP6^1 ^ ~ 

One of the hallmarks of yeast prions is that cells can be readily and 
30 reversibly cured of them. [PSt] is curable by several means, including growth on media 
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containing low concentrations of the protein denaturant guanidine hydrochloride and 
transient over-expression or deletion of the protein remodeling factor HSP104. 

Strains carrying [RPS^^ were passaged on medium containing 2.5 mM 
guanidine hydrochloride (GdnHCl) (Fluka) and then plated to YPD and to SD-ade to 
5 assay the suppressor phenotype. Cells passaged on GdnHCl no longer displayed the 

[RPS^] phenotype, while cells not treated with GdnHCl retained it. [RPS^] was also lost 
when the HSP104 gene was deleted by homologous recombination, performed using the 
same strategy as described above in section E, or when HSP104 was over expressed from 
a multicopy plasmid using the constitutive GPD promoter. Cells that had been cured of 

10 [RPS^^ by over-expression of HSP104 were passaged on YPD medium to isolate strains 
that had lost the over-expression plasmid. These strains remained [rps'^. Thus transient 
over-expression of HSP104 is sufficient to heritably cure cells of [RPS^^. 

Finally, we asked if Hspl04-mediated curing was reversible. Cells cured 
by over-expression of HSP104 were re-transformed with a plasmid bearing a single copy 

1 5 of KMC. To create the single-copy RMC plasmid in pRS3 1 6 (available from ATCC) the 
Clal-SacI fragment (includes promoter and RMC) friam the plasmid used above for the 
RMC gene replacement was ligated into the Clal-SacI site. Transformants were then 
plated onto SD-ade to assess the rate at which they converted to the [RPS^] suppressor 
phenotype. [RPS^] was regained at a rate comparable to that seen in the parental RMC 

20 strain, indicating that the transient over-expression of HSP104 caused no permanent 
alteration in susceptibility to [RPS^] conversion. 

I. Effect of endogenous Rnql upon \RPS''^ 

To determine if [TJPiS^] can act as an independent genetic element, the gene 
encoding the endogenous Rnql protein was deleted in strains carrying the RMC 
25 replacement of SUP35 using methods described above. The deletion had no effect upon 
the maintenance of the {RPS^} suppression phenotype. Growth on SD-ade was equally 
robust in [RPS^] and [RPS^] /^rnql strains. This indicates that Rmc can behave as an 
independent prion and is not dependent upon pre-existing Rnq Lin,an-insoluble state: " 




- 78- 

J. Physical state of the Rmc protein in \RPS^] and \rps'] strains 

Finally, we examined the localization of the Rmc fusion protein in the 
[RPS^] and [rps ] strains. Both strains were transformed with inducible plasmids that 
provided Rnql(153-405)-GFP expression that were constructed as described above in 
5 section A. Strains that lacked the endogenous Rnql gene were used to prevent the GPP 
marker from localizing to the endogenous Rnql aggregate. Short-term expression of the 
GFP-fusion protein prevented the formation of new [RPS^] elements in the [rps ] strain. 

Two distinct patterns of Rmc protein localization were revealed by this 
assay and these correlated with the phenotypic differences between [RPS^] and [rps ] 
10 strains. In the non-suppressing [rps ] strains, the Rnql(153-405)-GFP label was diffuse. 
In the suppressing [RPS^] strains, fluorescence was punctate, and was excluded from the 
nucleus. This punctate pattem was different from that observed with the endogenous 
Rnql aggregates, as Rmc aggregates are numerous and very small. 

Collectively, the foregoing experiments demonstrate that Rnql, which was 

1 5 identified based on sequence analysis, exhibits prion-like behaviour in numerous in vitro 
and in vivo assays. The search method used here shows that putative prions can be 
identified by a directed prion search rather than by the study of a pre-existing phenotype. 
In addition, this method will be applicable to the identification of prion proteins in many 
other organisms. Our demonstration that a new prion protein domain can substitute for 

20 that of another well-characterized prion, reproducing its phenotypic characteristics and 
epigenetic mode of inheritance, also provides a crucial tool in the analysis of 
uncharacterized candidates. 

We have shown that Rnql exists in distinct physical states - soluble and 
insoluble - in unrelated yeast strains. The insoluble state can be transmitted through 

25 cytoduction, and once transmitted is stably inherited. When the N-terminal prion- 

determining region of SUP35 was replaced with the C-terminal domain of RNQ J, the 
hybrid Rmc protein provided translation termination activity, mimicking the phenotype of 
[psi ] strains. At a low spontaneous frequency, the strain acquired_a_staye,JhLeritable- — — 

suppressor-phenotyper["/?/^S"']7 which mimicked the phenotype of [PSr] strains. 

30 Suppression was dominant and segregated to meiotic progeny in non-Mendelian ratios. 
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The possibility that this phenotype is caused by an epigenetic factor unrelated to the 
fusion protein was ruled out by genetic crosses showing that the phenotype is not 
expressed and can not be transmitted in strains that do not produce the fusion protein. 
The relationship of the suppression phenotype to protein conformation was further 
5 demonstrated by fluorescence localization of the hybrid protein in isogenic [RPS^] and 
[rps ] strains. In [RPS^] strains, most of the protein is sequestered into small foci and is 
presumably inhibited in its function in translational termination. Transient over- 
expression of Rmc greatly increased the frequency of conversion to [RPS^]. 

It is highly unusual for over-expression of a protein to cause a loss-of- 

10 function phenotype. It is even more unusual for phenotypes produced by over-expression 
to be stable after over-expression has ceased. Yet these properties are shared by the two 
yeast prion determinants and, to our knowledge, have been uniquely shared by them until 
now. They are believed to derive fi-om stabilization of an otherwise unstable protein 
conformation by protein-protein interactions. Proteins in the altered form then have the 

15 capacity to recruit new proteins of the same type to the same form. The phenotype 

associated with this change is, therefore, stably inherited fi"om generation to generation 
and transferred to mating partners in crosses. 

The ability of amino acid residues 153-405 of Rnql(SEQ ID NO: 50) to 
substitute for the N-terminal domain of Sup35 and recapitulate its prion behavior was by 

20 no means predictable. The C-terminal region of Rnql (residues 153-405) and the N- 

terminal region of Sup35 have no primary amino-acid sequence homology - only a similar 
enrichment in polar amino acids. Reconstituting the epigenetic behavior of a prion 
requires that the Rmc fusion protein achieve an unusual balance between solubility and 
aggregation. If the fusion protein is too likely to aggregate, the inactive state will be 

25 ubiquitous; if it is too likely to remain soluble, the inactive state will not be stable. To 
recapitulate the epigenetic behavior of [PSI+J the fusion protein must be able to switch 
from one state to the other and maintain either the inactive or the active state in a manner 
that is self perpetuating and highly stable from generation to generation. Even minor 
variations in the sequence of the N-terminal region of Sup35, including^eyeraLsingle — — 

30 amino-acid substitutions and-small'delefions, can prevent maintenance of the inactive 
state. And a small internal duplication destabilizes maintenance of the active state. 
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Therefore, the ability of the Rnql domain to substitute for the prion domain of Sup35 and 
to fully recapitulate its epigenetic behavior provides a rigorous test for its capacity to act 
as a prion and suggests that it has been honed through evolution to serve this function. 

The fusion of prion-determining regions with different functional proteins 
5 could be used to create a variety of recombinant proteins whose functions can be switched 
on or off in a heritable manner, both by nature and by experimental design. The two 
regions that constitute a prion, a functional domain and an epigenetic modifier of 
function, are modular and transferable. 

Example 8 

10 High-Throughput Assay to identify novel prion-like amyloidogenic sequences 

The procedures described in Example 5 are particularly usefial for identifying 
candidate prion-like sequences based on sequence characteristics and for screening these 
candidate sequences for useful prion-like properties. The following modification of those 
procedures provides a high-throughput genetic screen that is particularly useful for 

15 identifying sequences having prion-like properties firom any set of clones, including a set 
of uncharacterized clones, such as cDNA or genomic libraries. 

A library of short DNA fragments, such as genomic DNA fragments or 
cDNAs, is cloned in fi-ont of a sequence encoding the C-terminal domain of yeast Sup35 
to create a Hbrary of CSup35 chimeric constructs of the formula 5'-X-CSup35-3', 

20 wherein X is the candidate DNA fi*agment. Optionally, the 3' end of the construct 

encodes both the M and C domains of Sup35. This library is transformed into a [psi-] 
strain of yeast that carries Sup35 as a Ura+ plasmid (with its chromosomal Sup35 
deleted). Transformants are plated onto FOA-containing medium, which will cure the 
Ura+ plasmid so that the only functioning copy of Sup35 will be a fusion construct from 

25 the chimeric library. 

Viable transformants are transferred to a selective media to screen for 
transformants which can suppress nonsense codons in a [P57^]-like manner. For example, 

if the host cell is a yeast strain carrying a nonsense mutation in the ADEl gene,Jhe_ 

transformants are„screened-for-cells"that"ai^e^iable on a SD-ADE media. Cells that can 

30 survive via suppression of nonsense codons are selected for further analysis {e.g., as 
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described in preceding Examples), under the assumption that the library chimera has 
altered the function of Sup35. By using prion-specific tests such as histological 
examination for protein aggregates, curing, and Hspl04-dosage alteration, true 
aggregation-directing protein domains will be identified from original library of DNA 
constructs. The constructs which display prion-like properties can be used as described 
herein. Also, such constructs can be isolated and sequenced and used to identify and 
study the complete genes fi"om which they were derived, to see if the original gene/protein 
possesses prion properties in its native host. The foregoing assay also is useful for rapidly 
identifying fi-agments and variants of known prion-like proteins (NMSup35, NUre2, PrP, 
and so on) that retain prion-like properties. The assay, as well as chimeric constructs of 
the formula 5'-X-CSup35-3' and expression vectors containing such constructs, are 
considered additional aspects of the present invention. 

Example 9 

Fiber assembly mechanism of the prion-determining region (NM) of yeast Sup35p 

The investigation of specific protein aggregation is gaining an increasing 
role in conjunction with increasing numbers of human diseases characterized by altered 
protein structures, including prion-based encephalopathies, noninfectious 
neurodegenerative diseases, and systemic amyloidoses. Amyloid protein aggregates are 
P-sheet rich structures that form fibers in vitro and bind dyes such as CongoRed and 
ThioflavinT. Strikingly, most amyloids can promote the propagation of their own altered 
conformations, which is thought to be the basis of protein-mediated infectivity in prion 
diseases. This feature of protein self-propagation in amyloids may also be critical to 
disease progression in noninfectious amyloid diseases such as Alzheimer's or Parkinson's 
disease. A powerful system to study the molecular mechanism of amyloid propagation 
and specificity is the prion-like phenomenon [PSr] of Saccharomyces cerevisiae. 
Formation of higher ordered Sup35p complexes and the propagation of [PSI^] is caused 
by NM region of Sup35p. In vitro, both full-length Sup35p and NM form amyloid fibers 
with NM dictating the formation of the fiber axis while the C-terminal region_of,Sup35p- 
Jsjho.ught-to-be-loeated'on th^'periphery of the fibers. Detailed analysis by circular 
dichroism showed that NM adopts a mainly random coil structure in solution before it 
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changes slowly to a structure that is P-sheet-rich. This conformational conversion was 
shown to occur simultaneously to the formation of amyloid fibrils. 

In general, amyloid polymerization is considered to be a two-stage process 
initiated by the formation of a small nucleating seed or protofibril. Seed formation is 
5 thought to be oligomerization of soluble protein accompanied by a transition from a 
predominantly random coil to an amyloidogenic P-sheet conformation. Subsequent to 
nucleation, the seeds assemble with soluble protein to form the observed amyloid fibrils. 
The mechanisms for nucleation and fiber assembly are not well understood. 

Strikingly, the secondary structure of all proteins that form amyloid fibrils 
10 under physiological conditions is partially random coil in aqueous solutions. Such 
structure is usually significant for partially unfolded protein as found in folding 
intermediates. It is possible that this unique "high-energy" structure in solution is the 
driving force for fiber assembly of such proteins. Thereby, the fibrous aggregates might 
present the lowest energy conformer of these proteins. As a consequence, interference 
1 5 with their structural state in solution should influence their fiber assembly ability. This 
has been shown for Alzheimer's P-amyloid peptide, islet amyloid polypeptide, and the 
artificial peptide DAR16-IV, where changes in the secondary structure dramatically 
altered the fiber assembly process. 

The following experiments were performed to examine and characterize 
20 the folding and association pathway of soluble NM by starting with chemically denatured 
protein. Similar results were obtained with proteins isolated under non-denaturing 
conditions. These studies were facilitated by use of labeled cysteine-substituted NM 
mutants. A better understanding of the mechanisms of fiber assembly will facilitate 
manipulations of fiber growth under various conditions 

25 

A. Materials and methods 

Bacterial strains and culture 

Using pEMBL-Sup35p (an E, coli plasmid containing the Sup35 protein) 

as template, DNA enc oding NM was amplified b y PCR,with-various-linkers-for 

30 subcloning. For recombinant NM expression, the PGR products were subcloned as Ndel- 
BamHI fragments into pJC25. For GST-NM fusions, the PGR products were subcloned 
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as BamHI-EcoRI fragments into pGEX-2T (Pharmacia). For site-directed mutagenesis 
the protocol by Howorka and Bayley, Biotechniques, 25:764-766 (1998), was used for a 
high throughput cysteine scanning mutagenesis. A non-mutagenic primer pair for the P- 
lactamase gene and a mutagenic primer pair for each respective mutant were employed. 
In addition to generating a unique Nsil site, we used SphI and Nspl sites, which allows 
introduction of a cysteine codon in front of methionine and isoleucine or after alanine and 
threonine codons, to increase the number of mutants in our cysteine screen. The fidelity 
of each construct was confirmed by Sanger sequencing. Protein was expressed in E. coli 
BL21 [DE3] after inducing with ImM LPTG (OD^oonni of 0.6) at 25oC for 3 hours. 

Yeast strains and culture 

Using pJLI-Sup35pC-Sup35p as a template, DNA encoding each of the 
respective NM*"^ was amplified by PGR with two EcoRI sites for subcloning. To 
investigate the propagation and maintenance of [PST] by each NM*"^^ used, integrative 
constructs, constructed using the standard pRS series of vectors (available ft-om ATCC), 
were digested with Xbal and transformed into 74-D694 [PSF^ and \psi\ strains. 
Transformants were selected on uracil-deficient (SD-Ura) medium and confirmed by 
genomic PGR followed by digestion with Aatll, which cleaves the HA-tag between 
NM*^^^ and Sup35pG. Recombinant excision events were selected on medium containing 
5-fluoro-orotic acid. Only cells that have lost remaining integrative plasmids are able to 
grow on medium containing 5-fluoro-orotic acid. Again, replacements were confirmed by 
PGR followed by digestion with Aatll as described above. 

Protein purification 

NM and each NM^"^^ were purified after recombinant expression in E. 
coli by chromatography using Q-Sepharose (Pharmacia), hydroxyapatite (BioRad), and 
Poros HQ (Boehringer Mannheim) as a final step. All purification steps for NM or 
NM^"^^ were performed in the presence of 8M urea. GST-NM was purified by 
chromatography using Glutathione-Sepharose (Boehringer Manheim), Poros HQ 

(Boehri nger Mannheim ^ and-S-Sepharose-(-Pharmacia)-as-a-final-step— All-purification 

steps for GST-NM were performed in the presence of 50mM Arginine-HGl. Protein 
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concentrations were determined using the calculated extinction coefficient of 0.90 (NM, 
NM^^'^) or L23 (GST-NM) for a 1 mg/ml solution in a 1cm cuvette at 280nm. 
Secondary Structure Prediction 

Secondary structure of NM was predicted by using two independent 
5 prediction methods, GOR IV and Hierarchical Neural Network. Both methods were 
provided by Pole Bio-hiformatique Lyonnais. 

Secondary Structure Analysis 

CD spectra were obtained using a Jasco 715 spectropolarimeter equipped 
with a temperature control unit. All UV spectra were taken with a 0.1cm pathlength 

10 quartz cuvette (Hellma) in 5mM potassium phosphate (pH 7.4), 150mM NaCl and 
respective additives such as osmolytes in certain experiments. Protein concentration 
varied from 0.5|aM to 65|lM. Folding of chemically denatured NM or NM*^"^^ was 
monitored at 222 nm in time course experiments by diluting protein out of 8M Gdm*Cl 
(Guanidinium Hcl; final concentration 50mM) in the respective phosphate buffer. 

1 5 Thermal transition of NM or NM^^^ was performed with a heating/cooling increment of 
0.5oC/min. Spectra were recorded between 200nm and 250nm (2 accumulations). In a 
separate measurement, time courses were recorded for 30 sec at single wavelengths 
(208nm and 222nm) for each temperature and the mean value of each time course was 
determined. Temperature jump experiments were performed by incubating the sample in 

20 a water bath with the respective starting temperature for 30min. The cuvette was 

transferred to the spectropolarimeter already set to the final temperature and time courses 
were taken with a constant wavelength of 222rmi. Settings for wavelength scans: 
bandwidth, 5nm; response time, 0.25sec; speed, 20nm/min; accumulations, 4. All spectra 
were buffer-corrected. 

25 Fluorescent labeling ofNM^^^ 

The thiol-reactive fluorescent labels acrylodan and lANBD amide 
(Molecular Probes) were incubated with NM*"^^ for 2 hours at 25^0 according to the 
manufacturer's protocol. Remaining free label was removed by size exclusion 

chiwiatqgraphyjism 

30 efficiencies were determined by visible absorption using the extinction coefficients of 2 x 
10^ for acrylodan at 391nm and 2.5 x 10^ for lANBD 
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B. Construction and analysis of MM mutants 

To investigate the structural requirements for amyloid fiber assembly, we 
used yeast Sup35p's NM-region as a model protein. Until recently, fiber assembly 
kinetics of NM and other amyloid forming proteins have been monitored by binding of 
5 dyes such as CongoRed (CR) or ThioflavinT. To gain fiarther insight into NM folding 
and fiber assembly, a more sensitive method for detecting structural changes, such as that 
provided by intrinsic fluorescence, was necessary. As NM naturally lacks tryptophan, the 
only native amino acid with a reasonable environmental-sensitive fluorescence, site- 
directed mutagenesis could have been employed to artificially introduce tryptophan in 
10 NM. However, to improve experimental flexibility we introduced single cysteine 
substitutions throughout NM. Since NM naturally lacks cysteine, such single point 
;S mutations would allow probing of NM folding and assembly in a specific, well defined 

manner after cross-linking of fluorescent probes to the sulfhydryl-groups of cysteines. 
H= NM mutants with single cysteine replacements at amino acids throughout 

u I 1 5 NM that were predicted to be in structured regions or that were likely involved in the fiber 
; assembly process were constructed. These included the following fifteen mutants: NM^^^, 
g NM^^^^, NM^^^^, NM^"^^, NM^^^^, NM^^^^, NM^'^^^, NM^*'^^^, NM^»^^, NM'^^^^^, 

□ NM^»^'^, NM*'^^^^, NM^'^^^, NM^'^'^, and NM^''^^. As indicated in table 1 below, three of 

:S: the fifteen mutants, NM^^^*^, NM^^°*^, and NM^*^^*^, were not stably expressed at a 

Q 20 sufficiently high protein levels in E. coli. All other mutants were purified to homogeneity 
under denaturing conditions. To confirm that refolded NM attained a native protein 
structure, a GST-NM fiision protein was purified with thrombin, and GST was removed 
by binding to Glutathione-Sepharose. A structural comparison of refolded and native NM 
using far-UV circular dichroism (CD) showed no apparent differences between the two 
25 proteins. 
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TABLE 1 



NM 


Expression in 


Secondary Structure 


Fiber assembly 


Fiber morphology 


Protein 


E. coli 


[0222nm] 


(CR-binding) 


(EM) 


wild-type 


yes 


-2950 


yes 


smooth fibers up to 


(wt) NM 








35|lm long 




yes 


as wt 


as wt 


as wt 




not detectable 


_ 








yes 


as wt 


as wt 


as wt 




very low. 










not stable 










yes 


-6420 


slower assembly 


short fibers, only few 








rate 


are longer than 1 |ira 




yes 


-6250 


slower assembly 


short fibers, only few 








rate 


are longer than 1 p.m 


^M124C 


very low, 


- 


- 


- 




not stable 










yes 


-4570 


as wt 


as wt 




yes 


-4198 


as wt 


as wt 




yes 


as wt 


as wt 


as wt 




yes 


as wt 


as wt 


as vA. 




yes 


-4400 


as wt 


as 




yes 


-4000 


as wt 


less smooth, many 










short fibers 




yes 


-6410 


slower assembly 


many short fibers 








rate 






yes 


-3730 


no 


no detectable fibers 



To determine the direct influence of individual cysteine replacements on 
the folding and assembly of NM in vitro, the secondary structure of each NM^^ was 
compared_to-W-ild-type-NM-strueture by-far-^^ results are 

summarized in table 1 . Structurally, only NM^^^, NM*^^^*^, NM and NM*^^^^*^ were 
identical to wild-type NM. All other mutants contained a higher content of secondary 
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structure as indicated by an increased mean residue ellipiticity at [0] 222nnr NM and all 
Nmo-s^ with the exception of NM^^^^^, had identical mean residue ellipiticities at [0] 2osnm 

of 

-9000 degree cm^ dmol '. In contrast, NM^^^*^ had a decreased mean residue ellipiticity at 
[0] indicating that this mutant had an aberrant structure in comparison to wild-type 
5 NM than the other NM'^\ 

Next, fiber assembly of each mutant was performed on a roller drum and 
compared to wild-type NM assembly kinetics by binding of CongoRed (CR), which 
shows a spectral shift after interacting with amyloid fibers. Results form these 
experiments are summarized in table 1 . Only NM^^^^ did not bind CR under all 
10 conditions tested. NM*^^^^, NM^^^^, and NM^^^^*^ showed slightly altered CR-binding 
kinetics suggesting slower fiber assembly rates in comparison to wild-type NM. 

Electron microscopy (EM) was used to confirm that NM*^^^ fibers were 
"^t; morphologically identical to wild-type fibers. As indicated in table 1, the electron 

'H= micrographs showed no apparent differences in fiber density, fiber diameter, or other 

1 5 morphological features in comparison to wild-type NM for NM^^^, NM^^^^, NM*^*^^^, 
NM^'^^^, NM^^'^^, NM^*^'^, and NM*^^^^^. NM^^^^ fibers were not detectable by EM, 
Q suggesting that the apparent lack of CR-binding of NM^^^^ was not due to structural 

P differences in fibers that affected CR-binding. Results fi*om CD (secondary structure), 

'Sj CR-binding (fiber assembly kinetics), and EM (fiber morphology) indicate that the 

a 20 NM^^^, NM^^^^, NM^*^^^, and NM^^^^^ mutants display no apparent differences to wild- 
type NM with respect to these parameters. To further confirm that the chosen cysteine 
mutants were not influencing the principal properties of NM, genomic wild-type NM 
could be replaced by Nm*^^^ 

C. Covalent binding of fluorescent labels to NM *"^ 
25 Environmentally sensitive fluorescent probes, such as naphthalene 

derivatives or benzofurazans, are commonly used to detect conformational changes and 
assembly processes of proteins. Here, we made use of 6-acryloyl-2- 

dimethylaminonaphathlene (acrylodan)_aDd_A^,,A[!^dijnethyl--A^-^(-iodoac 



nitrobenz-2-oxa-l,3-diazol-4-yl)ethylene diamine (lANBD amide) both of which react 
30 specifically with free thiol-groups on proteins. Whereas acrylodan is very sensitive to its 
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structural environment, lANBD amide exhibits appreciable fluorescence when Hnked to 
buried or unsolvated thiols. Therefore, the latter fluorescence is highly sensitive to 
changes in the solvation level of the fluorophore as seen in folding events, whereas 
acrylodan is more powerful for investigating conformational changes of a protein. The 
5 specific labeling efficiencies of soluble NM*"^ were in the range of 0.40 to 0.78 (mol 
label/mol protein) with unspecific binding below 0.05 mol/mol for both fluorescent 
probes. 

After covalent binding to NM*^^^, the influence of the fluorescent labels on 
fiber assembly was investigated. No differences were found in fiber assembly for 7 

10 mutants (see table 1) in the presence of fluorescent labels in comparison to non-labeled 
protein as detected by CR-binding. No gross structural changes in assembled fibers were 
visible by EM for NM^^^S NM*'^^^^, NM^*^^, NM^^''^, NM^^^'^, and NM*'*^^^. In 
contrast, NM^^^ fibers labeled with both acrylodan and lANBD amide appeared rougher 
with an overall shorter length, although these changes were subtle. 

15 To determine the incorporation of labeled NM*"^^ into fibers, equal amounts 

of labeled and non-labeled protein were mixed. The amount of label in the soluble 
protein fi-action was detected over the course of fiber assembly. During the experiment, 
the label to protein ratio was constant indicating an equal incorporation of labeled and 
non-labeled protein into fibers. The resulting fibers were monitored for fluorescent 

20 emission of the respective label. Both measurements showed that fluorescent-labeled 

protein was sufficiently incorporated into amyloid fibers without influencing the assembly 
kinetics or the assembled state for NM^^^*^, NM''*^^^, NM^*^*^, NM^*^^^, NM^'^^^, and 



25 



The foregoing experiments examined the folding process of NM using 
NM*"^^ mutants that exhibited folding processes and structural characteristics similar to 
wild-type NM. These results provide a better understanding of the process of NM folding. 
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Example 10 

Bi-directional formation of fibers composed of the 
prion-determining region (NM) of yeast Sup35p 

The following experiments were performed to demonstrate that fibers composed of 
5 the NM region of Sup35p are capable of adding NM protein at both ends of the fiber. This 
was investigated using a mutant NM protein, in which the lysine residue at position 184 
was substituted by cysteine, that was capable of forming fibers labeled with specifically 
modified gold colloids. Visualization of the gold-labeled fibers allowed determination of 
the directionality of fiber growth. 

10 A. Determining the accessibility of cysteine residues in assembled fibers 

First, the accessibility of cysteine residues was assayed in fibers composed 
of cysteine-substituted mutant NM (NM*'^)proteins, each of which carried different single 
cysteine replacements at amino acid residues throughout the NM protein. All Nm*^^, 
described in Example 9 above, that formed fibers were examined. For fiber assembly, 

15 NM'^^^ protein was diluted out of 4M Gdm*Cl 80-fold into 5 mM potassium phosphate (pH 
7.4), 150 mM NaCl to yield a final NM*"^^ protein concentration of 10 |aM. To accelerate 
the rate of fiber assembly, all NM''^ proteins were incubated on a roller drum (9 rpm) for 
12 hours. The resulting fibers were sonicated with a Sonic Dismembrator Model 302 
(Artek) using an intermediate tip for 15 seconds. Sonication resulted in small sized fibers 

20 that did not reassemble to larger fibers as determined by electron microscopy (EM). 

Seeding of fiber assembly was performed by addition of 1% (v/v) of the sonicated fibers to 
soluble NM'^^^ protein. 

To test the accessibility of cysteines in assembled fibers composed of NM""^ 
proteins, EZ-link PEO-maleimide-conjugated biotin (Pierce, product number 21901) was 

25 added to the assembled fibers and the labeling efficiency of the biotin was assayed. EZ- 
link PEO-maleimide-conjugated biotin was covalently linked to assembled NM*"^^ fibers 
for 2 hours at 25°C according to the manufacturer's protocol (protocol number 0748). 

Remaining free biotin was removed b y size excl usion^hromatography-using-D-Salt 

Excellulose desalting columns (Pierce, product number 20450). Labeling efficiency was 

30 determined by competing for avidin binding between biotin and [2-(4'-hydroxybenzene)] 
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benzoic acid (HABA). The binding of HABA to avidin results in a specific absorption 
band at 500 nm. Since biotin displaces the HABA dye due to higher affinity of biotin for 
avidin, as compared to that of HABA dye for avidin, the binding of HABA to avidin and 
thus the specific absorption at 500 nm decreases proportionately when biotin is added to 
5 the reaction. Results from this assay indicated that fibers composed of either NM*"^^ 

proteins in which the lysine residue at position 184 was substituted by a cysteine residue 
(K184C) or NM*^^ proteins in which the serine residue at position 2 was substituted by a 
cysteine residue (S2C), bound a detectable amount of biotin. S2C fibers had a labeling 
efficiency of 0.16 mol biotin/mol protein, and K184C fibers exhibited a labeling efficiency 
10 of 0.56 mol biotin/mol protein. Thus, the cysteine residue at position 184 is highly 

accessible and the cysteine residue at position 2 is partially accessible on the surface of 
assembled fibers. 

B. Analysis of fiber growth using EM 

K184C sonicated fibers were tested for their ability to seed fiber assembly 

15 of soluble wild-type NM protein. Fiber assembly was performed as described above using 
sonicated K184C fibers as seeds to assemble soluble wild-type NM protein. The rate of 
fiber assembly was assayed by CongoRed binding (CR-binding) and fiber morphology was 
examined by EM. For EM studies, protein solutions were negatively stained as previously 
described in Spiess et al., 1987, Electron Microscopy and Molecular Biology: A Practical 

20 Approach^ Oxford Press, p. 147- 166. Images were obtained with a CM 120 Transmission 
Electron Microscope (Phillips) with an LaB6 filament, operating at 120 V in low dose 
mode at a magnification of 4500x and recorded on Kodak SOI 63 film. Results fi*om CR- 
binding and EM experiments show that K184C fibers are able to seed wild-type NM fiber 
assembly. The resulting mixed K184C/NM fibers showed no apparent differences in 

25 assembly rate or morphology to fibers seeded with sonicated wild-type NM fibers. Similar 
results were obtained when biotinylated K184C seeds were used fro fiber assembly. 

The surface exposure of the cysteine at position 184 in assembled fibers 

composed of the K184C mutant protein allowed sufficient labelin g_oXfibers-with 

specifically modified gold colloids. Monomaleimido Nanogold '^^^ (Nanoprobes, product 

30 number 2020A) with a particle diameter of 1 .4 nm was covalently cross-linked to the 
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sulfhydryl group of accessible cysteine residues in sonicated K184C fibers for 18 hours at 
4°C according to the manufacturer's protocol. Remaining free Nanogold™ was removed 
by a repeated size exclusion chromatography using D-Salt Excellulose desalting columns 
(Pierce, product number 20450). The extent of labeling was determined by UV/visible 
5 absorption using extinction coefficients for Nanogold"^^ of 2.25 x 10^ at 280 nm and 1.12 
X 10^ at 420 nm. Ratios of optical densities at 280 nm and 420 nm allowed an 
approximation of the labeling efficiency. These gold-labeled fibers were employed to seed 
fiber growth of soluble wild-type NM protein. 

To visualize the 104 nm Nanogold™ particles attached to the assembled 

10 mixed K184C/NM fibers, we used Goldenhance™ (Nanoprobes) according to the 

manufacturer's instructions. Briefly, equal volumes of enhancer (Solution A) and activator 
(Solution B) were combined and incubated for 15 min at room temperature, hiitiator 
(Solution C) was then added at a volume equal to that of enhancer or activator, and the 
resulting mixture was diluted (1 :2) with phosphate buffer (Solution D). The final solution 

15 acts as an enhancing reagent by selectively depositing gold onto Nanogold ™ particles, 
thereby providing enlargement of Nanogold™ to give electron-dense enlarged Nanogold 
™ particles in the electron microscope. For negative staining of gold-labeled fibers, 6 \i\ 
of protein (8 |aM, 1% (w/w) gold labeled seed) were applied to a 400 mesh carbon-coated 
copper grid (Ted Pella) for 45 seconds. After washing with 100 [il phosphate buffer, grids 

20 were incubated with the final Goldenhance'^^ enhancing reagent, prepared as described 
above, for 5 min. After washing with 200 |-ll glass-distilled water, negative staining was 
employed as in Spiess et al., 1987 Electron Microscopy and Molecular Biology: A 
Practical Approach, Oxford Press, p. 147-1 66. EM results revealed that the gold-labeled 
K184C regions are located in the middle of the assembled K184C/NM fibers indicating bi- 

25 directional fiber assembly with no apparent polarity in the seeds used. 

The foregoing experiments show that fiber assembly of NM proteins occurs 
at both ends of the fibers. These analyses were performed using Kl 84C, a NM''^ mutant 

wherein the lysine residue at position 184 has been substituted with^axysteine residuer 

^Experiments by biotin-labeling of the cysteine residues on assembled K184C fibers were 
30 carried out to determine accessibility of the cysteines. Since wild-type NM protein does 
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not contain any cysteine residues, labeling can only occur at position 184. Results show 
that position 184 is highly accessible in assembled K184C fibers. The ability of 
specifically modified gold colloids to covalently cross-link the sulfhydryl group of 
cysteines enabled generation of gold-labeled fibers that can be visualized by EM. 
Examination of fiber assembly, by taking advantage of the ability of K184C to produce 
gold-labeled fibers, indicates that fiber growth occurs bi-directionally. It fiarther indicates 
that fibers with specific modifications and attachments, a single fiber containing modified 
and unmodified regions, and mixtures of modified and unmodified fibers can be produced. 

While the present invention has been described in terms of specific 
embodiments, it is understood that variations and modifications will occur to those in the 
art, all of which are intended as aspects of the present invention. Accordingly, only such 
limitations as appear in the claims should be placed on the invention. 



